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(57) Abstract 

The present invention relates to novel SH3 domain binding protein, referred to herein as a DEP polypeptides. The DEP polypeptides 
comprise several motifs including a arc SH3 consensus binding sequence, four ankyrin repeats, one zinc finger domain and six copies of 
a proline-rich tandem repeat DEP polypeptides may function as mediators of SH3 domain-dependent signal transduction pathways and, 
thus may mediate multiple signaling events such as cellular gene expression, cytoskeletal architecture, protein trafficking and endocytosis, 
cell adhesion, migration, proliferation and differentiation. Described herein are isolated and antisense nucleic acids molecules, recombinant 
expression vectors, host cells and non-human transgenic animals containing an insertion or a disruption of the DEP gene. Diagnostic, 
screening and therapeutic methods utilizing the compositions of the invention are also provided. 
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DIFFERENTIATION ENHANCING FACTORS AND USES THEREFOR 

Background of the Invention 

Cellular interactions can be viewed as proceeding in two steps. Initially, 
5 an extracellular molecule binds to a specific receptor on a target cell, converting 
the dormant receptor to an active state. Subsequently, the receptor stimulates 
intracellular biochemical pathways leading to a cellular response, which may 
involve progression through the cell cycle, as well as changes in cellular gene 
expression, cytoskeletal architecture, protein trafficking, endocytosis, cell 

10 adhesion, migration, proliferation and differentiation, among others. An 
intracellular biochemical pathway which mediates some of these cellular 
responses involves members of the c-src family of protein tyrosine kinases, such 
as ipp6(f' src . Src tyrosine kinases transduce extracellular signals as diverse as 
responses to growth factors (for example, platelet derived growth factor (PDGF), 

15 epidermal growth factor (EGF)), antigens, cytokines, extracellular matrix 

molecules, among others. These extracellular signals give rise to a myriad of 
cellular responses, such as mitotic function, activation of Ras dependent 
pathways, phosphatidyl inositol 3-kinase activation and cytoskeletal 
reorganization. 

20 The amino terminus of pp60 c " jrc contains two motifs of approximately 

100 and 60 amino acids in length named Src homology 2 and 3 domains (SH2, 
SH3), respectively. SH2 and SH3 domains have been identified in numerous 
signal transduction proteins (Pawson, T. and J. Schlessinger (1993) J. Curr. Bio. 
3:434-442; Courtneidge et al. (1994) Trends Cell Biol 4:345-347; Pawson, T. 

25 (1995) Nature 373: 573-580). These domains presumably function as modular 
units that interact with other signal transduction proteins. The importance of 
SH2 and SH3 domains in signal transduction is underscored by the identification 
of "adapter proteins", such as c-crk (Reichman et al., 1992), c-nck (Chou et al., 
1992) and grb-2/ASH (Margolis et al., 1992; Matuoka et al., 1992), which lack a 

30 catalytic domain, and thus, appear to function as adaptors between membrane 
signaling and multiple downstream targets. 

Proteins containing SH2 domains control biochemical pathways as 
diverse as phospholipid metabolism, tyrosine phosphorylation and 
dephosphorylation, activation of Ras-like GTPases, gene expression, protein 

35 trafficking and cytoskeletal architecture (Pawson, T. and J. Schlessinger (1993) 
J. Curr. Bio. 3:434-442). In vivo, SH2-containing proteins bind to 
phosphotyrosine (pTyr)-containing sites on activated receptors and cytoplasmic 
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phosphoproteins (Anderson et al. (1990) Science 250:979-982; Matsuda et al. 
(1990) Science 248:1537-1539; Valius, M. and A. Kazlauskas (1993) Cell 
73:321-334). Indeed, crystal structures of the SH2 domains show a pocket 
configuration of amino acids that interact directly with a phosphotyrosine residue 
5 of an associated protein. Based on the crystal structure, the amino acid residues 
adjacent to the residues in direct contact with the phosphotyrosine determine the 
specificity of the interaction (Waksman et al. (1993) J. Cell 72:779-790; Lee et 
al. (1994) Structure 2:423-438). 

SH3 domains have been found in a number of proteins involved in 

10 tyrosine kinase signaling, but also in cytoskeletal components and subunits of the 
neutrophil cytochrome oxidase, among others (Drubin et al. (1990) Nature 
343:288-290; Leto et al. (1990) Science 248:727-730). In contrast to SH2 
domains which interact with phosphorylated tyrosine residues of an associated 
protein, phosphorylation does not appear to be necessary for a protein to interact 

15 with a SH3 domain. The first SH3 binding protein identified, 3bp-l, shows 
homology to rho GTPase activating protein (GAP) (Cicchetti et al., (1992) 
Science 257:803). C3G was initially identified as a GTP exchange factor for 
several G proteins, and was subsequently shown to have affinity for the SH3 
domains of Crk and Grb-2 (Tanaka et al. (1994) Proa Natl Acad. Sci USA 

20 91:3443-3447). G proteins themselves may be the targets for the binding of SH3 
containing proteins. As an illustration, the proline rich C-terminus of the brain 
specific form of dynamin binds to several SH3 domains including those found in 
pp60<^ ^ ppS^-^ but not pp 5sc~fgr ( Gou t et al., 1993; Seedorf et al. 
(1994) 7. Biol Chem. 269:16009-16014). Dynamin is a microtubule-associated 

25 GTPase that is involved in endocytosis (Takel et al., 1995; Hinshaw et al., 1995). 
The binding of a SH3 domain to dynamin results in an increase in intrinsic 
GTPase activity (Gout et al., 1 993). 

SH3-binding sites consist of proline-rich peptides of approximately 1 0 
amino acids (Ren et al. (1993) Science 259:1 157-1 161; Yu et al. (1994) Cell 

30 76:933-945), which bind to isolated SID domains with dissociation constants of 
5-100 (ref. 25). Recent structural and mutagenic analysis of peptide-SH3 
complexes (Feng et al. (1994) Science 266:1241-1246; Lim et al. (1994) Nature 
372:375-379; Musacchio et al. (1994) Nature Struct. Biol 1:546-551 ; Wittekind 
et al. (1994) Biochemistry 33:13531-13539; Rickles et al. (1994) EMBOJ. 

35. 13:5598-5604) shows that peptides associated with SH3 domains adopt a left- 
handed polyproline type II helix, with three residues per turn, as illustrated by a 
PXXP consensus sequence (P=Proline, X=any amino acid) that forms a 
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polyproline type II helix (Yu et al. (1994) Cell 76:933-945). Solution and crystal 
structures of SH3 domains complexed with small peptides indicate a groove in 
the SH3 domain where the prolines of the PXXP helix are situated (Lim et al. 
(1994) Nature 372:375-379; Yu et al. (1994) Cell 76:933-945; Musacchio et al. 

5 (1 994) Nature Struct Biol 1:546-55 1 ). Residues adjacent to the prolines also 
form contacts within the SID sequence and these interactions determine the 
specificity between a protein and a particular SH3 domain. For example, the 
arginine in "RPLPXXP" forms a salt bridge with aspartate at position 99 of 
ppeO 0 "^. However the C-terminal arginine in the sequence " AFAPPLPRR" 

1 0 contacts the identical aspartate in pp60 c " %src , indicating that proteins may interact 
with SH3 domains in either a "plus" or "minus" orientation (named "class I" and 
"class II" binding, respectively; Yu et al. (1994) Science 258:1665; Lim et al. 
(1994) Nature 372:375-379). 

Several proteins that interact with the SID domains of src -family kinases 

1 5 have been shown to be implicated in cellular growth. These include the 

regulatory subunit of phosphatidyl-inositol-3-kinase, p85 (Prasad et al. (1993) 
Proa Natl Acad ScL USA 91:2834-2838), SHC (Weng et al., 1994), and ras 
GTPase-activating protein (Briggs et al., 1995). Furthermore, mutants within the 
SH3 domains of the adapter proteins c-crk and grb-2 inhibit v-abl oncogenic 

20 activity presumably by acting as "dominant negative" signal transduction 
effectors (Tanaka et al. (1995) Proc. Natl Acad Set USA 91 :3443-3447). 

Despite much progress in characterizing the signal trasnduction pathways 
involving SH3 domains, there is a great need for identifying novel mediators of 
these pathways, and in particular, binding proteins that interact with these SID 

25 domains. The identification of these novel molecules may provide for a detailed 
analysis of the amino acid contacts that determine the binding affinity and 
specificity of SH3 domains with an associated protein, which may in turn 
facilitate the development of therapeutic agents to be used in treating a diverse 
number of disorders. 

30 

Summary of the Invention 

The present invention is based, at least in part, on the discovery of 
nucleic acid molecules which encode a novel family of src SH3 binding proteins, 
referred to herein as "differentiation enhancing factors" or "DEF polypeptides". 
35 The DEF molecules show a highly conserved N-terminal domain and divergent 
C-terminus. The N-terminal domain preferably includes several structural motifs 
such as at least one src SID consensus binding sequence, at least one, and 
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preferably four ankyrin repeats, at least one zinc finger domain, at least one 
pleckstrin homology domain and at least one C2 domain. The C-terminal 
domain diverges between family members, and may include at least one, and 
preferably three, more preferably six copies of a proline-rich tandem repeat and 
5 an SH3 domain. In one embodiment, DEF molecules of the invention are 
cytoplasmic proteins which function as mediators of signal transduction 
pathways of, for example, SH3 domain containing molecules, thus mediating 
multiple events including gene expression, cytoskeletal architecture, protein 
trafficking and endocytosis, cell adhesion, migration, proliferation and 

1 0 differentiation. In a preferred embodiment, DEF molecules of the invention 
modulate the differentiation of precursor cells, e.g., adipose or neural precursor 
cells. The DEF molecules of the invention may therefor be useful in the 
treatment of disorders, for example, hyperplastic and neoplastic tissues. 

In one aspect, the invention provides isolated nucleic acid molecules 

1 5 encoding a DEF polypeptide. Such nucleic acid molecules (e.g., cDNAs) have a 
nucleotide sequence encoding a DEF polypeptide or biologically active portions 
thereof, such as a polypeptide having one or more of the following 
characteristics: the ability to bind to an SH3 domain in an intra- or 
intermolecular interaction; the ability to dimerize with like molecules or other 

20 molecules; the ability to anchor cytoskeletal elements to the plasma membrane; 
the ability to modulate the activity of signal transduction molecules, e.g., kinase 
activity, e.g., p38 MAP kinase activity, or G protein activity, e.g., GTPase 
activity; the ability to synergize with the activity of peroxisome proliferator 
activated receptor y (PPARy); the ability to induce expression of PPARy; the 

25 ability to induce the terminal differentiation of a hyperproliferative cell; or the 
ability to induce adipogenesis or neurogenesis. In a preferred embodiment, the 
isolated nucleic acid molecule has a nucleotide sequence shown in Figure 2, SEQ 
ID NO: 1; Figure 13, (SEQ ID NO: 3 or SEQ ID NO: 5); Figure 14, (SEQ ID 
NO: 6 or SEQ ID NO: 8); or Figure 15, (SEQ ID NO: 9 or SEQ ID NO: 1 1), or a 

30 portion thereof such as the coding region of the nucleotide sequence of Figure 2, 
SEQ ID NO: 1; Figure 13, (SEQ ID NO: 3); Figure 14, (SEQ ID NO: 6); or 
Figure 15, (SEQ ID NO: 9). Other preferred nucleic acid molecules encode a 
protein having the amino acid sequence of Figure 3, SEQ ID NO: 2; or Figure 
12, (SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10). Nucleic acid molecules 

35 derived from a mammalian, preferably, a human cell (e.g., a naturally-occurring 
nucleic acid molecule found in a mammalian brain or an adipocyte cell) which 
hybridize under stringent conditions to the nucleotide sequence shown in Figure 
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2, SEQ ID NO: 1; Figure 13, (SEQ ID NO: 3 or SEQ ID NO: 5); Figure 14, 
(SEQ ID NO: 6 or SEQ ID NO: 8); or Figure 15, (SEQ ID NO: 9 or SEQ ID NO: 
1 1) are also within the scope of the invention. 

In another embodiment, the isolated nucleic acid molecule is a nucleotide 
5 sequence encoding a protein having an amino acid sequence which is at least 
about 80%, preferably at least about 85%, more preferably at least about 90% 
and most preferably at least about 95-99% overall amino acid sequence identity 
with an amino acid sequence shown in Figure 3, SEQ ID NO: 2; or Figure 12, 
(SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 1 0). This invention further 

1 0 pertains to nucleic acid molecules which encode a protein which includes one or 
more of the following: at least one SH3 consensus binding sequence having an 
amino acid sequence at least 80%, preferably at least 90%, more preferably at 
least 95-99% identical to an amino acid sequence shown in Figure 3, SEQ ID 
NO: 2; or Figure 12, (SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10); at 

15 least one ankyrin repeat, preferably two or three, and most preferably four 

ankyrin repeats, having an amino acid sequence at least 80%, preferably at least 
90%, more preferably at least 95-99% identical to an amino acid sequence shown 
in Figure 3, SEQ ID NO: 2; or Figure 12, (SEQ ID NO: 4, SEQ ID NO: 7, or 
SEQ ID NO: 10); a zinc finger domain having an amino acid sequence at least 

20 80%, preferably at least 90%, more preferably at least 95-99% identical to an 
amino acid sequence shown in Figure 3, SEQ ID NO: 2; or Figure 12, (SEQ ID 
NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10); a pleckstrin homology domain 
having an amino acid sequence at least 80%, preferably at least 90%, more 
preferably at least 95-99% identical to an amino acid sequence shown in Figure 

25 3, SEQ ID NO: 2; or Figure 12, (SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 
10); and a C2 domain having an amino acid sequence at least 80%, preferably at 
least 90%, more preferably at least 95-99% identical to an amino acid sequence 
shown in Figure 3, SEQ ID NO: 2; or Figure 12, (SEQ ID NO: 4, SEQ ID NO: 7, 
or SEQ ID NO: 1 0). Further within the scope of this invention are nucleic acid 

30 molecules which encode a protein which includes a proline-rich repeat having an 
amino acid sequence at least 80%, preferably at least about 90%, more preferably 
at least about 95-99% identical to an amino acid sequence shown in Figure 3, 
SEQ ID NO: 2. This invention also encompasses nucleic acid molecules which 
encode a protein which includes an SH3 domain having an amino acid sequence 

35 at least about 80%, preferably at least about 90%, more preferably at least about 
95-99% identical to an amino acid sequence shown in Figure 3, SEQ ID NO: 2 
or Figure 1 2, SEQ ID NO: 4 or SEQ ID NO: 7. 
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Nucleic acid molecules encoding proteins which include one or more of 
the following: at least one SH3 consensus binding sequence having an amino 
acid sequence at least about 60% (preferably at least about 70%, 80%, 90%, or 
95-99%) identical to an amino acid sequence shown in Figure 3, SEQ ID NO: 2, 
5 or Figure 12, SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10; at least one 
ankyrin repeat having an amino acid sequence at least about 60% (preferably at 
least about 70%, 80%, 90%, or 95-99%) identical to an amino acid sequence 
shown in Figure 3, SEQ ID NO: 2, or Figure 12, SEQ ID NO: 4, SEQ ID NO: 7, 
or SEQ ID NO: 10, a zinc finger domain having an amino acid sequence at least 
1 0 about 60% (preferably at least about 70%, 80%, 90%, or 95-99%) identical to an 
amino acid sequence shown in Figure 3, SEQ ID NO: 2 or Figure 12, SEQ ID 
NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10, a pleckstrin homology domain having 
an amino acid sequence at least 60% (preferably at least about 70%, 80%, 90%, 
or 95-99%) identical to an amino acid sequence shown in Figure 3, SEQ ID NO: 
15 2; or Figure 12, (SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10), a C2 
domain having an amino acid sequence at least about 60% (preferably at least 
about 70%, 80%, 90%, or 95-99%) identical to an amino acid sequence shown in 
Figure 3, SEQ ID NO: 2; or Figure 12, (SEQ ID NO: 4, SEQ ID NO: 7, or SEQ 
ID NO: 1 0); a proline-rich repeat having an amino acid sequence at least about 
20 60% (preferably at least about 70%, 80%, 90%, or 95-99%) identical to an amino 
acid sequence shown in Figure 3, SEQ ID NO: 2, and an SH3 domain having an 
amino acid sequence at least about 60% (preferably at least about 70%, 80%, 
90%, or 95-99%) identical to an amino acid sequence shown in Figure 3, SEQ ID 
NO: 2 or Figure 12, SEQ ID NO: 4 or SEQ ID NO: 7, are also within the scope 
25 of this invention. 

Another aspect of this invention pertains to nucleic acid molecules 
encoding a DEF polypeptide fusion protein which includes a nucleotide sequence 
encoding a first peptide having an amino acid sequence at least about 80% 
(preferably at least about 90%, or 95-99%) identical to an amino acid sequence 
30 shown in Figure 3, SEQ ID NO: 2 or Figure 12, SEQ ID NO: 4, SEQ ID NO: 7, 
or SEQ ID NO: 10, and a nucleic sequence encoding a second peptide 
corresponding to a moiety that facilitates detection or purification or alters the 
solubility of this fusion protein, such as glutathione-S-transferase, or an 
enzymatic activity such as alkaline phosphatase, or an epitope tag. 
35 In another embodiment, the isolated nucleic acid molecule is a nucleotide 

sequence encoding a polypeptide fragment of at least about 5, 10, 15, 20, 25, 30, 
40, 50, 60, 70, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 
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750, 800, 850-1 125 amino acid residues in length, preferably at least about 5-250 
amino acid residues in length, and more preferably at least about 10-200 amino 
acid residues in length corresponding to a protein having at least about 80% the 
amino acid sequence shown in Figure 3, (SEQ ID NO: 2) or Figure 12, SEQ ID 
5 NO: 4, SEQ ID NO: 7, or SEQ ID NO: 1 0. In a preferred embodiment, the 
polypeptide fragment has a DEF activity, e.g., induces adipogenesis or 
neurogenesis. 

Moreover, given the disclosure herein of a DEF polypeptide-encoding 
cDNA sequence (e.g., SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 

10 NO: 6, SEQ ID NO: 8, SEQ ID NO: 9 or SEQ ID NO: 1 1), antisense nucleic acid 
molecules (i.e, molecules which are complimentary to the coding strand of the 
DEF polypeptide cDNA sequence) are also provided by the invention. 
Accordingly, the DEF nucleic acid molecule can be non-coding, (e.g., probe, 
antisense or ribozyme molecules) or can encode a functional DEF polypeptide 

15 (e.g., a polypeptide which specifically modulates, e.g., by acting as either an 

agonist or antagonist, at least one biological activity of the DEF polypeptide). In 
a preferred embodiment, a DEF nucleic acid molecule includes the coding region 
of Figure 1, (SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 6, or SEQ ID NO: 9). 
Furthermore, in certain preferred embodiments, the subject DEF nucleic 

20 acids will include a transcriptional regulatory sequence, e.g., at least one of a 
transcriptional promoter or transcriptional enhancer sequence, which regulatory 
sequence is operably linked to the DEF gene sequences. Such regulatory 
sequences can be used to render the DEF gene sequences suitable for use as an 
expression vector. This invention also encompasses cells transfected with said 

25 expression vector whether prokaryotic or eukaryotic and a method for producing 
DEF proteins by employing the expression vectors. 

Accordingly, another aspect of the invention pertains to recombinant 
expression vectors containing the nucleic acid molecules of the invention and 
host cells into which such recombinant expression vectors have been introduced. 

30 In one embodiment, such a host cell is used to produce DEF polypeptide by 

culturing the host cell in a suitable medium. If desired, DEF polypeptide can be 
then isolated from the medium or the host cell. 

Still another aspect of the invention pertains to isolated DEF polypeptides 
and active fragments thereof, such as peptides having an activity of a DEF 

35 polypeptide (e.g., at least one biological acitivity of DEF polypeptide, such as the 
ability to bind to a src SH3 domain, the ability to induce PPARy expression; or 
the ability to induce the terminal differentiation of a cell, e.g., an adipose or a 
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neural precursor cell, e.g., a transformed adipose or a neural precursor cell). The 
invention also provides an isolated preparation of a DEF polypeptide. In 
preferred embodiments, the DEF polypeptide comprises an amino acid sequence 
of Figure 3, (SEQ ID NO: 2), or Figure 12, (SEQ ID NO: 4, SEQ ID NO: 7, or 

5 SEQ ID NO: 10). In other embodiments, the isolated DEF polypeptide 
comprises an amino acid sequence at least 60 % identical to an amino acid 
sequence of Figure 3, (SEQ ID NO: 2) or Figure 12, (SEQ ID NO: 4, SEQ ID 
NO: 7, or SEQ ID NO: 10) and, preferably has an activity of DEF polypeptide 
(e.g., at least one biological activity of DEF polypeptide). Preferably, the protein 

10 is at least about 70 %, more preferably at least about 80 %, even more preferably 
at least about 90 % and most preferably at least about 95-99 % identical to the 
amino acid sequence of Figure 3, SEQ ID NO: 2 or Figure 12, SEQ ID NO: 4, 
SEQ ID NO: 7, or SEQ ID NO: 10. 

This invention also pertains to isolated polypeptides which include one or 

1 5 more of the following: a src SID consensus binding sequence having an amino 
acid sequence that is at least about 60% (preferably at least about 70%, 80%, 
90%, or 95-99%) identical to anamino acid sequence shown in Figure 3 (SEQ 
ID NO: 2) or Figure 12 (SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10), at 
least one ankyrin repeat, preferably two or three, and most preferably four 

20 ankyrin repeats, having an amino acid sequence that is at least 50% (preferably at 
least 60%, 70%, 80%, 90%, or 95-99%) identical to an amino acid sequence 
shown in Figure 3 (SEQ ID NO: 2) or Figure 12 (SEQ ID NO: 4, SEQ ID NO: 7, 
or SEQ ID NO: 10), a zinc finger domain having an amino acid sequence that is 
at least about 50% (preferably at least about 60%, 70%, 80%, 90%, or 95-99%) 

25 identical to an amino acid sequence shown in Figure 3 (SEQ ID NO: 2) or Figure 
12 (SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10), a pleckstrin homology 
domain having an amino acid sequence that is at least about 50% (preferably at 
least about 60%, 70%, 80%, 90%, or 95-99%) identical to an amino acid 
sequence shown in Figure 3 (SEQ ID NO: 2; or Figure 12 (SEQ ID NO: 4, SEQ 

30 ID NO: 7, or SEQ ID NO: 10), a C2 domain having an amino acid sequence that 
is at least about 50% (preferably at least about 60%, 70%, 80%, 90%, or 95- 
99%) identical to an amino acid sequence shown in Figure 3 (SEQ ID NO: 2); or 
Figure 12 (SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10), a proline-rich 
tandem repeat having an amino acid sequence that is at least about 50% 

35 (preferably at least about 60%, 70%, 80%, 90%, or 95-99%) identical to an 

amino acid sequence shown in Figure 3 (SEQ ID NO: 2) or Figure 12 (SEQ ID 
NO: 4), and an SH3 domain having an amino acid sequence that is at least about 
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50% (preferably at least about 60%, 70%, 80%, 90%, or 95-99%) identical to an 
amino acid sequence shown in Figure 3 (SEQ ID NO: 2) or Figure 12 (SEQ ID 
NO: 4 or SEQ ID NO: 7). 

The invention also provides for a DEF polypeptide comprising a first 
peptide having an amino acid sequence at least about 80% identical to an amino 
acid sequence shown in Figure 3 (SEQ ID NO: 2) or Figure 1 2 (SEQ ID NO: 4, 
SEQ ID NO: 7, or SEQ ID NO: 10) and a second peptide corresponding to a 
moiety that facilitates detection or purification or alters the solubility of this 
fusion protein, such as glutathione-S-transferase, or an enzymatic activity such as 
alkaline phosphatase, or an epitope tag. 

Polypeptides comprising a polypeptide fragment of at least about 5, 10, 
15, 20, 25, 30, 40, 50, 60, 70, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 
600, 650, 700, 750, 800, 850-1 125 amino acid residues in length, preferably at 
least about 5-250 amino acid residues in length, and more preferably at least 
about 10-220 amino acid residues in length, and most preferably at least about 
200 amino acid residues corresponding to a protein having at least about 80% the 
amino acid sequence shown in Figure 3 (SEQ ID NO: 2) or Figure 12 (SEQ ID 
NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10). In a preferred embodiment, the 
polypeptide fragment has a DEF activity, e.g., induces adipogenesis or 
neurogenesis. 

Still another aspect of the invention pertains to isolated DEF polypeptide 
and active fragments thereof, such as polypeptides having an activity of a DEF 
polypeptide (e.g., at least one biological acitivity of DEF, such as the ability to 
bind to an SH3 domain in an intra- or intermolecular interaction, a polypeptide 
capable of dimerizing to like molecules or other molecules, a polypeptide 
capable of anchoring cytoskeletal elements to the plasma membrane, a 
polypeptide capable of modulating the activity of signal transduction molecules, 
e.g., kinase activity, e.g., p38 MAP kinase activity, or G protein activity, e.g., 
GTPase activity, a polypeptide capable of inducing PPARy expression, a 
polypeptide capable of inducing the terminal differentiation of a 
hyperproliferative cell, e.g., a transformed cell, e.g., a transformed adipose cell, 
or a polypeptide capable of inducing adipogenesis or neurogenesis). 

The invention also provides an isolated preparation of a DEF protein. In 
a preferred embodiment, the isolated DEF protein comprises an amino acid 
sequence at least 70 % identical to an amino acid sequence of Figure 3 (SEQ ID 
NO: 2) or Figure 12 (SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10) and, 
preferably has an activity of DEF (e.g., at least one biological activity of DEF). 
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Preferably, the protein is at least about 80 %, more preferably at least about 90- 
95 %, even more preferably at least about 96-98 % and most preferably at least 
about 99 % identical to the amino acid sequence of Figure 3, (SEQ ID NO: 2) or 
Figure 12 (SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10). 
5 In another embodiment, the DEF protein comprises an amino acid 

sequence of Figure 3 (SEQ ID NO: 2) or Figure 12 (SEQ ID NO: 4, SEQ ID NO: 
7, or SEQ ID NO: 10). This invention also pertains to isolated polypeptides 
which include a src SH3 consensus binding sequence having an amino acid 
sequence that is at least 80%, preferably at least about 85%, more preferably at 

10 least about 86- 99% identical to a src SH3 consensus binding sequence shown in 
Figure 3, (SEQ ID NO: 2) or Figure 12 (SEQ ID NO: 4, SEQ ID NO: 7, or SEQ 
ID NO: 10), at least one ankyrin repeat, preferably two or three, and most 
preferably four ankyrin repeats, having having an amino acid sequence that is at 
least about 80%, preferably at least about 85%, more preferably at least about 86- 

1 5 99% identical to an amino acid sequence shown in Figure 3 (SEQ ID NO: 2) or 
Figure 12 (SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10), a zinc finger 
domain having an amino acid sequence that is at least about 80%, preferably at 
least about 85%, more preferably at least about 86- 99% identical to an amino 
acid sequence shown in Figure 3 (SEQ ID NO: 2) or Figure 12 (SEQ ID NO: 4, 

20 SEQ ID NO: 7, or SEQ ID NO: 10), a pleckstrin homology domain having an 
amino acid sequence that is at least about 80%, preferably at least about 85%, 
more preferably at least about 86- 99% identical to an amino acid sequence 
shown in Figure 3 (SEQ ID NO: 2), or Figure 12 (SEQ ID NO: 4, SEQ ID NO: 
7, or SEQ ID NO: 10), a C2 domain having an amino acid sequence that is at 

25 least about 80%, preferably at least 85%, more preferably at least about 86- 99% 
identical to an amino acid sequence shown in Figure 3 (SEQ ID NO: 2), or 
Figure 12, (SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10), a proline-rich 
repeat having an amino acid sequence that is at least about 80%, preferably at 
least about 85%, more preferably at least about 86- 99% identical to an amino 

30 acid sequence shown in Figure 3 (SEQ ID NO: 2) or Figure 12 (SEQ ID NO: 4), 
and an SH3 domain having an amino acid sequence that is at least about 80%, 
preferably at least about 85%, more preferably at least about 86- 99% identical to 
an amino acid sequence shown in Figure 3 (SEQ ID NO: 2) or Figure 12 (SEQ 
ID NO: 4 or SEQ ID NO: 7). 

35 The invention also provides for a DEF fusion protein comprising a first 

polypeptide having an amino acid sequence at least about 80% (preferably at 
least 90%, or 95-99%) identical to an amino acid sequence shown in Figure 3 
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(SEQ ID NO: 2) or Figure 12 (SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 
10) and a nucleotide sequence encoding a second polypeptide corresponding to a 
moiety that facilitates detection or purification or alters the solubility of the 
fusion protein, such as glutathione-S-transferase, or an enzymatic activity such as 
5 alkaline phosphatase, or an epitope tag. In preferred embodiments, the fusion 
protein comprises one or more of a src SH3 consensus binding sequence, an 
ankyrin repeat, a zinc finger domain, a PH domain, a C2 domain, a proline-rich 
repeat, or an SH3 domain of a DEF polypeptide. 

Yet another aspect of the present invention features an immunogen 

1 0 comprising a DEF polypeptide in an immunogenic preparation, the immunogen 
being capable of eliciting an immune response specific for a DEF polypeptide; 
e.g. a humoral response, e.g. an antibody response; e.g. a cellular response. In 
preferred embodiments, the immunogen includes an antigenic determinant, e.g. a 
unique determinant, from a protein having at least about 80%, preferably at least 

15 about 85%, more preferably at least about 87-99% identity with the amino acid 
sequence represented by one of Figure 3 (SEQ ID NO: 2) or Figure 12 (SEQ ID 
NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10). 

A still further aspect of the present invention features antibodies and 
antibody preparations specifically reactive with an epitope of the DEF 

20 immunogen. 

The invention also features transgenic non-human animals, e.g. mice, 
rats, rabbits, chickens, frogs or pigs, having a transgene, e.g., animals which 
include (and preferably express) a heterologous form of a DEF gene described 
herein, or which misexpress an endogenous DEF gene, e.g., an animal in which 

25 expression of one or more of the subject DEF proteins is disrupted. Such a 

transgenic animal can serve as an animal model for studying cellular and tissue 
disorders comprising mutated or mis-expressed DEF alleles or for use in drug 
screening. 

The invention also provides probes and primers composed of 
30 substantially purified oligonucleotides, which correspond to a region of 
nucleotide sequence which hybridizes to at least 6 consecutive nucleotides 
preferably at least 25 more preferably at least 40, 50 or at least 75 consecutive 
nucleotides of either sense or antisense sequences of Figure 2 (SEQ ID NO:l), 
Figure 1 3 (SEQ ID NO: 3 or SEQ ID NO: 5), Figure 14 (SEQ ID NO: 6 or SEQ 
35 ID NO: 8), or Figure 1 5 (SEQ ID NO: 9 or SEQ ID NO: 1 1 ) or naturally 

occurring mutants thereof. In preferred embodiments, an oligonucleotide of the 
present invention specifically detects a DEF nucleic acid relative to other nucleic 
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acid in a sample. In yet another embodiment, the probe/primer further includes a 
label which is capable of being detected. The label group can be selected, e.g., 
from a group consisting of radioisotopes, fluorescent compounds, enzymes, and 
enzyme co-factors. Probes of the invention can be used as a part of a diagnostic 
5 test kit for identifying dysfunctions associated with mis-expression of a DEF 
protein, such as for detecting in a sample of cells isolated from a patient, a level 
of a nucleic acid encoding a DEF protein; e.g. measuring a DEF mRNA level in 
a cell, or determining whether a genomic DEF gene has been mutated or deleted. 
These so-called "probes/primers" of the invention can also be used as a part of 

1 0 "antisense" therapy which refers to administration or in situ generation of 
oligonucleotide probes or their derivatives which specifically hybridize (e.g. 
bind) under cellular conditions, with the cellular mRNA and/or genomic DNA 
encoding one or more of the subject DEF proteins so as to inhibit expression of 
that protein, e.g. by inhibiting transcription and/or translation. Preferably, the 

15 oligonucleotide is at least 12 nucleotides in length, although primers of 25, 40, 
50, or 75 nucleotides in length are also encompassed. 

Yet another aspect of the present invention concerns a method for 
modulating one or more of a cell by modulating a DEF biological activity, e.g., 
by potentiating or disrupting certain protein-protein interactions. In general, 

20 whether carried out in vivo, in vitro, or in situ, the method includes treating the 
cell with an effective amount of DEF or a DEF agent so as to alter, relative to the 
cell in the absence of treatment, at least one or more of (i) cellular gene 
expression, (ii) cell proliferation, (iii) cell differentiation, e.g., differentiation of 
adipose or neural precursor cells, (iv) signal transduction, (v) cytoskeletal 

25 architecture, (vi) protein trafficking, (vii) adhesion of a cell. Accordingly, the 
method can be carried out with DEF or a DEF agents such as peptide and 
peptidomimetics or other molecules identified in the drug screens devised herein 
which agonize or antagonize the effects of signaling from a DEF protein or 
ligand binding of a DEF protein, e.g., an intracellular target molecule, e.g., an 

30 SH3 domain-containing molecule, a G protein, e.g., GTPase protein, or a 
cytoskeleton molecule. Other DEF agents include antisense constructs for 
inhibiting expression of DEF proteins, and different domains of the DEF proteins 
that may act as dominant negative mutants of DEF proteins which competitively 
inhibit ligand interactions upstream and signal transduction downstream of a 

35 DEF protein. 
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In one embodiment, the subject method of modulating a DEF biological 
activity can be used in the treatment of hyperproliferative cell to modulate 
growth arrest and terminal differentiation of a cell. In a preferred embodiment, 
the modulation of DEF activity occurs in an adipocyte or neural cell, in order to 
5 modulate adipocyte or neuronal differentiation. In another embodiment, the 
subject method is used to modulate induce growth arrest and differentiation of a 
cancer cell. 

In yet another aspect, the invention provides a drug screening assay for 
screening test compounds for modulators, e.g., inhibitors, or alternatively, 

10 potentiators, of an interaction between an SH3 domain-containing protein, e.g., a 
DEF molecule or a c-src protein tyrosine kinases, e.g., pp60 c ~* yrc and a DEF 
polypeptide or a biologically active portion thereof, e.g., an SH3 binding domain. 
An exemplary method includes the following (a) forming a reaction mixture 
including: (i) a pp60 c * 5rc , (ii) a DEF or an SID binding domain, and (iii) a test 

15 compound; and (b) detecting interaction of the pp60 c ~ 5rc and DEF or an SH3 
binding domain. A statistically significant change (potentiation or inhibition) in 
the interaction of the ppdO^-^and DEF or an SH3 binding domain in the 
presence of the test compound, relative to the interaction in the absence of the 
test compound, indicates a potential agonist (mimetic or potentiator) or 

20 antagonist (inhibitor) of said interaction. The reaction mixture can be a cell-free 
protein preparation, e.g., a reconsituted protein mixture or a cell lysate, or it can 
be a recombinant cell including a heterologous nucleic acid recombinantly 
expressing the DEF polypeptide. 

In another embodiment, an assay is provided for screening for modulators 

25 of an interaction between a DEF polypeptide or biologically active portions 
thereof, e.g., a src SH3 consensus binding sequence, an ankyrin repeat, a zinc 
finger domain, a PH domain, a C2 domain, a proline-rich repeat and an SH3 
domain, with signaling molecules. As an illustrative embodiment, test 
compounds that modulate the interaction between a DEF polypeptide or an 

30 ankyrin repeat and a cytoskeletal molecule can be tested. 

In preferred embodiments, the steps of the assay are repeated for a 
variegated library of at least 100 different test compounds, more preferably at 
least 10 3 , 10 4 or 1 0 5 different test compounds. The test compound can be, e.g., a 
peptide, a nucleic acid, a small organic molecule, or natural product extract (or 

35 fraction thereof). 
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Another aspect of the present invention provides a method of determining 
if a subject, e.g. an animal patient, is at risk for a disorder characterized by 
unwanted biological activity of a DEF polypeptide. The method includes 
detecting, in a tissue of the subject, the presence or absence of a genetic lesion 
5 characterized by at least one of (i) a mutation of a gene encoding a DEF protein; 
or (ii) the mis-expression of a DEF gene. In preferred embodiments, detecting 
the genetic lesion includes ascertaining the existence of at least one of: a deletion 
of one or more nucleotides from a DEF gene; an addition of one or more 
nucleotides to the gene, a substitution of one or more nucleotides of the gene, a 

10 gross chromosomal rearrangement of the gene; an alteration in the level of a 
messenger RNA transcript of the gene; the presence of a non-wild type splicing 
pattern of a messenger RNA transcript of the gene; a non-wild type level of the 
protein; and/or an aberrant level of soluble DEF protein. 

For example, detecting the genetic lesion can include (i) providing a 

1 5 probe/primer including an oligonucleotide containing a region of nucleotide 
sequence which hybridizes to a sense or antisense sequence of a DEF gene or 
naturally occurring mutants thereof, or 5* or 3 1 flanking sequences naturally 
associated with the DEF gene; (ii) exposing the probe/primer to nucleic acid of 
the tissue; and (iii) detecting, by hybridization of the probe/primer to the nucleic 

20 acid, the presence or absence of the genetic lesion; e.g. wherein detecting the 

lesion comprises utilizing the probe/primer to determine the nucleotide sequence 
of the DEF gene and, optionally, of the flanking nucleic acid sequences. For 
instance, the probe/primer can be employed in a polymerase chain reaction 
(PCR) or in a ligation chain reaction (LCR). In alternate embodiments, the level 

25 of a DEF protein is detected in an immunoassay using an antibody which is 
specifically immunoreactive with the DEF protein. 

Another aspect of the invention provides a method for inhibiting 
proliferation of a hyperproliferative cell, e.g., a neoplastic cell, comprising 
ectopically expressing DEF or a functional fragment thereof in a cell in order to 

30 induce differentiation of the cell. In one embodiment, ectopic expression of DEF 
in a precursor cell may result in the differentiation of a hyperproliferative cell, 
e.g., an adipocyte precursor cell, or a cells derived from an adipose tumor, e.g., 
lipomas, fibrolipomas, lipoblastomas, lipomatosis, hibernomas, hemangiomas 
and/or liposarcomas, into adipocytes. In other embodiments, activation of DEF 

35 may synergize with other signaling agents to augment the differentiated 

phenotype. Thus, DEF alone or in combination with other agents can be used for 
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the treatment of, or prevention of a disorder characterized by aberrant cell 
growth. 

For example, the subject method can be used in the treatment of disorders 
mediated by an aberrant activity of a PPARy receptor. The subject method can 
5 be used in treating disorders characterized by the aberrant activity of an 
adipocyte precursor cell, e.g., obesity. 

As another example, the subject method can be used in the treatment of 
sarcomas, carcinomas and/or leukemias. Exemplary disorders for which the 
subject method may be used as part of a treatment regimen include: 

10 fibrosarcoma, myxosarcoma, liposarcoma, chondrosarcoma, osteogenic sarcoma, 
chordoma, angiosarcoma, endotheliosarcoma, lymphangiosarcoma, 
lymphangioendotheliosarcoma, synovioma, mesothelioma, Ewing's tumor, 
leiomyosarcoma, rhabdomyosarcoma, colon carcinoma, pancreatic cancer, breast 
cancer, ovarian cancer, prostate cancer, squamous cell carcinoma, basal cell 

15 carcinoma, adenocarcinoma, sweat gland carcinoma, sebaceous gland carcinoma, 
papillary carcinoma, papillary adenocarcinomas, cystadenocarcinoma, medullary 
carcinoma, bronchogenic carcinoma, renal cell carcinoma, hepatoma, bile duct 
carcinoma, choriocarcinoma, seminoma, embryonal carcinoma, Wilms 1 tumor, 
cervical cancer, testicular tumor, lung carcinoma, small cell lung carcinoma, 

20 bladder carcinoma, epithelial carcinoma, glioma, astrocytoma, medulloblastoma, 
craniopharyngioma, ependymoma, pinealoma, hemangioblastoma, acoustic 
neuroma, oligodendroglioma, meningioma, melanoma, neuroblastoma, and 
retinoblastoma. 

In certain embodiments, the subject method can be used to treat such 
25 disorders as carcinomas forming from tissue of the breast, prostate, kidney, 
bladder or colon. 

In other embodiments, the subject method can be used to treat 
hyperplastic or neoplastic disorders arising in adipose tissue, such as adipose cell 
tumors, e.g., lipomas, fibrolipomas, lipoblastomas, lipomatosis, hibernomas, 
30 hemangiomas and/or liposarcomas. 

In still other embodiments, the subject method can be used to treat 
hyperplastic or neoplastic disorders of the hematopoietic system, e.g., leukemic 
cancers. In a preferred embodiment, the subject is a mammal, e.g., a primate, 
e.g., a human. 

35 The practice of the present invention will employ, unless otherwise 

indicated, conventional techniques of cell biology, cell culture, molecular 
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biology, transgenic biology, microbiology, recombinant DNA, and immunology, 
which are within the skill of the art. Such techniques axe explained fully in the 
literature. See, for example, Molecular Cloning A Laboratory Manual, 2nd Ed., 
ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 

, 5 1989); DNA Cloning, Volumes I and II (D. N. Glover ed., 1985); 

Oligonucleotide Synthesis (M. J. Gait ed., 1984); Mullis et al. U.S. Patent No: 
4,683,195; Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 1984); 
Transcription And Translation (B. D. Hames & S. J. Higgins eds. 1984); Culture 
Of Animal Cells (R. I. Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells 

1 0 And Enzymes (IRL Press, 1 986); B. Perbal, A Practical Guide To Molecular 
Cloning (1984); the treatise, Methods In Enzymology (Academic Press, Inc., 
N. Y.); Gene Transfer Vectors For Mammalian Cells (J. H. Miller and M. P. 
Calos eds., 1987, Cold Spring Harbor Laboratory); Methods In Enzymology, 
Vols. 154 and 155 (Wu et al. eds.), Immunochemical Methods In Cell And 

1 5 Molecular Biology (Mayer and Walker, eds., Academic Press, London, 1 987); 
Handbook Of Experimental Immunology, Volumes I-IV (D. M. Weir and C. C. 
Blackwell, eds., 1986); Manipulating the Mouse Embryo, (Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1986). 

Other features and advantages of the invention will be apparent from the 

20 following detailed description, and from the claims. 

Brief Description of the Drawings 

Figures 1 A-1B are silver-stained gels depicting the SDS/PAGE 
electrophoretic resolution of bovine DEF-1 protein. Figure 1 A shows 

25 SDS/PAGE analysis of src SH3 binding proteins by passing bovine brain lysates 
over src SH3 and src SH3SH2 affinity columns. Figure IB depicts further 
analysis of proteins which bound to src SH3 and src SH3SH2 affinity columns 
by passing eluted proteins over an ATP agarose column. Molecular size markers 
in kilodaltons are indicated on the left side. 

30 Figure 2 is the full-length nucleotide sequence of the bovine DEF-1 gene 

(coding and untranslated regions; SEQ ID NO: 1). 

Figure 3 is the predicted amino acid sequence of bovine DEF-1 (SEQ ID 
NO: 2). The number of the last amino acid in a line is noted on the right. The 
following domains were identified: pleckstrin homology domain corresponding 

35 to amino acids 326-419; zinc finger domain 457-480; C2 domain corresponding 
to amino acids 498-557; ankyrin-related motifs corresponding to amino acids 
356-374, 604-623, 640-659 and 672-692; SH3 consensus binding sequence 
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corresponding to amino acids 794-799, 803-809, 829-835, 895-901 and 993-999; 
proline-rich repeat corresponding to amino acids 934-1001 ; and SH3 domain 
corresponding to amino acids 1073-1 123. Key: overline = peptide sequenced; 
and underline = putative alternative exon. 
5 Figure 4 is a schematic representation of the structure of bovine DEF- 1 . 

Figure 5 is a Western blot depicting the association of bovine DEF-1 with 
src SH3 by passing lysates made from bovine brain (brain extract) or insect cells 
infected with baculovirus pp60c-src (gv src) over afinity columns containing two 
glutathione S-transferase (GST) fusion proteins spanning regions of bovine DEF- 
10 1 . The fusion proteins were: GST-src binding domain (GST-DEF-1 amino acids 
777-926) and GST-C terminal of DEF-1 (GST-C 928-1 129) as indicated. Bound 
proteins were resolved by SDS-PAGE electrophoresis and detected using an anti- 
src antibody. 

Figure 6A is an alignment of the amino acid sequences of various SH3 
15 domains found in c-src, c-fgr, c-fyn, c-abl, p85 and grb-2N. Highly conserved 
residues that are presumably in direct contacts with SH3-binding sites are 
indicated. 

Figure 6B is a schematic representation of the interaction of a src SH3 
consensus binding sequence adopting a polyproline type II helix conformation 

20 and an SH3 domain. Figures 7A and 7B are schematic representations of the 
putative left-handed polyproline type II helix configuration of bovine DEF-1 
proline-rich motifs (amino acids 934-1001). Figure 7A represents the putative 
structure of repeats 1-3 (amino acids 934-974). Figure 7B represents the putative 
structure of repeats 3-6 (amino acids 966-1001). Circles represent the amino 

25 acid indicated with a single letter code. 

Figure 8 is an alignment of the amino acid sequences in the SH3 domain 
of bovine DEF-1 with its SH3 binding site. Represented in between the SH3 
domains is an alignment of the proline-rich repeats in a homodimer 
configuration, the Interacting basic and acidic residues are indicated by squares 

30 and circles, respectively. 

Figures 9A is an alignment of the amino acid sequences of the C2 domain 
(amino acids 498-557) of bovine DEF-1 (DEF zinc) with other C2 containing 
proteins. 

Figure 9B is an alignment of the amino acid sequences of the C2 domain 
35 (amino acids 498-557) of bovine DEF-1 (DEF zinc) with other C2 containing 
proteins that also contain a zinc finger domain. 
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Figure 10 is a bar graph summarizing the enhanced level of adipocytic 
differentiation in control PPARy-expressing Balb/3T3 cells (left, solid bar) 
compared to Balb/3T3 cells co-expressing PPARy and DEF-1 (right, spleckled 
bar) in the presence of the indicated concentrations of pioglitazone (pio). 
5 Figure 1 1 is a schematic representation of deletion mutants of bovine 

DEF-1. DEF-1 /Apa mutants (amino acids 1-800) and DEF-l/Bgl mutants (last 
200 amino acids of bovine DEF-1) are indicated. 

Figure 12 is an alignment of the amino acid sequences of DEF family 
members. Amino acid sequences corresponding to bovine DEF-1 (SEQ ID NO: 
10 2); zebrafish DEF-1 (SEQ ID NO: 4); zebrafish DEF-2 (SEQ ID NO: 7); 
zebrafish DEF-3 (SEQ ID NO: 10); and human DEF-2 (SEQ ID NO: 12) are 
indicated. 

Figure 13 is the full-length nucleotide sequence of the zebrafish DEF-1 
gene (coding and untranslated regions; SEQ ID NO: 3). 
15 Figure 14 is the full-length nucleotide sequence of the zebrafish DEF-2 

gene (coding and untranslated regions; SEQ ID NO: 6). 

Figure 15 is the full-length nucleotide sequence of the zebrafish DEF-3 
gene (coding and untranslated regions; SEQ ID NO: 9). 

Figure 16 is a schematic representation of zebrafish DEF family structure. 

20 

Detailed Description of the Invention 

The present invention is based on the discovery of novel molecules, 
referred to herein as "differentiation enhancing factors" or DEF protein and 
nucleic acid molecules, which comprise a family of molecules having certain 

25 conserved structural and functional features. The term "family" when referring 
to the protein and nucleic acid molecules of the invention is intended to mean 
two or more proteins or nucleic acid molecules having a common structural 
domain and having sufficient amino acid or nucleotide sequence homology as 
defined herein. Such family members can be naturally occurring and can be 

30 from either the same or different species. For example, a family can contain a 
first protein of human origin, as well as other, distinct proteins of human origin 
or alternatively, can contain homologues of non-human origin. Members of a 
family may also have common functional characteristics. 

One aspect of the invention pertains to nucleic acids encoding DEF 

35 family members and DEF polypeptides. Preferably, a DEF family member 
includes at least one SH3 consensus binding sequence, at least one, preferably 
four ankyrin repeats, at least one zinc finger domain, at least one pleckstrin 
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homology domain and at least one C2 domain. In another embodiment, a DEF 
family member has at least one or more of the above-identified domains and has 
an amino acid sequence which is at least about 40% identical to an amino acid 
sequence shown in Figure 2 (SEQ ID NO:2). 
5 In yet another embodiment, a DEF family member has an amino acid 

sequence, which is at least about 40% identical to an amino acid sequence shown 
in Figure 2 (SEQ ID NO:2). 

In another embodiment, a DEF family member has one or more of the 
above-identified domains and is encoded by a nucleic acid which encodes an 

10 amino acid sequence which is at least about 40% identical to an amino acid 
sequence shown in Figure 2 (SEQ ID NO:2). 

In still another embodiment, a DEF family member is encoded by a 
nucleic acid which encodes an amino acid sequence which is at least about 40% 
identical to an amino acid sequence shown in Figure 2 (SEQ ID NO:2). 

15 In still another embodiment, a DEF family member has at least one 

biological activity of a DEF polypeptide, such as the ability to bind to an SH3 
domain in an intra- or intermolecular interaction, a polypeptide capable of 
dimerizing to like molecules or other molecules, a polypeptide capable of 
anchoring cytoskeletal elements to the plasma membrane, a polypeptide capable 

20 of modulating the activity of signal transduction molecules, e.g., kinase activity, 
e.g., p38 MAP kinase activity, or G protein activity, e.g., GTPase activity, a 
polypeptide capable of inducing PPARy expression, a polypeptide capable of 
inducing the terminal differentiation of a hyperproliferative cell, e.g., a 
transformed cell, e.g., a transformed adipose cell, or a polypeptide capable of 

25 inducing adipogenesis or neurogenesis). In yet another embodiment, a DEF 
family member: (i) has one or more of the above-identified domains, (ii) is 
encoded by a nucleic acid which encodes polypeptide having an amino acid 
sequence, which is at least about 40% identical to an amino acid sequence shown 
in Figure 2 (SEQ ID NO:2), (iii) is a polypeptide having an amino acid sequence, 

30 which is at least about 40% identical to an amino acid sequence shown in Figure 
2 (SEQ ID NO:2), and (iii) has at least one biological activity of a DEF 
polypeptide. 

In another aspect, the invention features nucleic acids encoding a DEF-1 
polypeptide, as well as DEF-1 polypeptides. Such DEF- 1 nucleic acids and 
35 polypeptides have at least one SH3 consensus binding sequence, at least one, 
preferably four ankyrin repeats, at least one zinc finger domain, at least one 
pleckstrin homology domain, at least one C2 domain, at least one proline-rich 
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repeat, and at least one SH3 domain. In one embodiment, the DEF-1 polypeptide 
has the above-identified domains and is encoded by a nucleic acid which is at 
least about 60% (preferably at least about 61-65%, 70%, 80%, 90% or 95-99%) 
identical to the nucleotide sequence of Figure 2 (SEQ ID NO: 1) or Figure 13 
5 (SEQ ID NO: 3 or SEQ ID NO: 5). In another embodiment, the DEF-1 

polypeptide is encoded by a nucleic acid which is at least about 60% (preferably 
at least about 61-65%, 70%, 80%, 90% or 95-99%) identical to the nucleotide 
sequence of Figure 2 (SEQ ID NO: 1) or Figure 13 (SEQ ID NO: 3 or SEQ ID 
NO: 5). 

10 In other embodiments, the DEF-1 polypeptide has the above-identified 

domains and has an amino acid sequence which is at least about 60% (preferably 
at least about 70%, 71-74%, 75%, 80%, 90% or 95-99%) identical to the amino 
acid sequence of Figure 3 (SEQ ID NO: 2) or Figure 12 (SEQ ID NO: 4). In 
other embodiments, the DEF-1 polypeptide has an amino acid sequence which is 

15 at least about 60% (preferably at least about 70%, 71-74%, 75%, 80%, 90% or 
95-99%) identical to the amino acid sequence of Figure 3 (SEQ ID NO: 2) or 
Figure 12 (SEQ ID NO: 4). In still another embodiment, the DEF-1 polypeptide 
has at least one biological activity of a DEF polypeptide. In yet another 
embodiment, the DEF-1 polypeptide: (i) has one or more of the above-identified 

20 domains, (ii) is encoded by the above-described nucleic acids, (iii) has the above- 
described amino acid sequence, and (iv) has at least one biological activity of a 
DEF polypeptide. 

In one embodiment, the DEF-1 polypeptide is a protein of a calculated 
molecular weight of approximately 120-130 kDa, and preferably 125 kDa 

25 consisting of approximately 1 129 amino acids and having the amino acid 
sequence shown in Figures 3 (SEQ ID NO: 2) or Figure 12 (SEQ ID NO: 4). 
Each DEF polypeptide consists of an amino terminal portion of about 350 amino 
acids (about amino acids 1 -350 of the sequence shown in Figure 2 (SEQ ID NO: 
2) or Figure 12 (SEQ ID NO: 4)) followed by four ankyrin repeats (each of about 

30 20 amino acids in length), at least one SH3 binding site (each of about 1 0 amino 
acids), a proline-rich repeat of about 68 amino acids, a PH domain, a C2 domain 
of about 60 amino acids and an SH3 domain of about 50 amino acids. 

In another embodiment, the DEF-1 polypeptide includes a C-terminal 
domain of the molecule. As used herein, a "C-terminal domain" is a polypeptide 

35 of about 100-300 amino acids, more preferably, about 150-250 amino acids, and 
most preferably 200 amino acids which includes at least one proline-rich repeat 
and at least one SH3 domain. Preferably, the C-terminal domain of DEF-1 has at 
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least one of the above-identified domains and has an amino acid sequence which 
is at least about 60% (preferably at least about 70%, 71-74%, 75%, 80%, 90% or 
95-99%) identical to the amino acid sequence of the last 200 amino acids of 
Figure 3 (SEQ ID NO: 2) or Figure 12 (SEQ ID NO: 4). In another embodiment, 
5 the C-terminal domain of DEF-1 has an amino acid sequence which is at least 
about 60% (preferably at least about 70%, 71-74%, 75%, 80%, 90% or 95-99%) 
identical to the amino acid sequence of the last 200 amino acids of Figure 3 
(SEQ ID NO: 2) or Figure 12 (SEQ ID NO: 4). In still another embodiment, the 
C-terminal domain of DEF-1 has at least one biological activity of a DEF 

10 polypeptide, e.g., induces adipogenesis. In yet another embodiment, the C- 

terminal domain of DEF-1 : (i) has one or more of the above-identified domains, 
(ii) the above-described amino acid sequence, and (iii) at least one biological 
activity of a DEF polypeptide. 

In yet another aspect, the invention features nucleic acids encoding a 

15 DEF-2 polypeptide, as well as DEF-2 polypeptides. Such DEF-2 nucleic acids 
and polypeptides have at least one SIB consensus binding sequence, at least one, 
preferably four ankyrin repeats, at least one zinc finger domain, at least one 
pleckstrin homology domain, at least one C2 domain, and at least one SH3 
domain. In one embodiment, the DEF-2 polypeptide has the above-identified 

20 domains and is encoded by a nucleic acid which is at least about 60% (preferably 
at least about 61-65%, 70%, 80%, 90% or 95-99%) identical to the nucleotide 
sequence of Figure 14 (SEQ ID NO: 6 or SEQ ID NO: 8). In another 
embodiment, the DEF-2 polypeptide is encoded by a nucleotide which is at least 
about 60% (preferably at least about 61-65%, 70%, 80%, 90% or 95-99%) 

25 identical to the nucleotide sequence of Figure 14 (SEQ ID NO: 6 or SEQ ID NO: 
8). 

In other embodiments, the DEF-2 polypeptide has one or more the above- 
identified domains and has an amino acid sequence which is at least about 70 % 
(preferably at least about 71-74%, 75%, 80%, 90% or 95-99%) identical to the 

30 amino acid sequence of Figure 12 (SEQ ID NO: 7). In other embodiments, the 
DEF-2 polypeptide has an amino acid sequence which is at least about 60% 
(preferably at least about 70%, 71-74%, 75%, 80%, 90% or 95-99%) identical to 
the amino acid sequence of Figure 12 (SEQ ID NO: 7). In still another 
embodiment, the DEF-2 polypeptide has at least one biological activity of a DEF 

35 polypeptide. In yet another embodiment, the DEF-2 polypeptide: (i) has one or 
more the above-identified domains, (ii) is encoded by the above-described 
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nucleic acids, (iii) has the above-described amino acid sequence, and (iv) at least 
one biological activity of a DEF polypeptide. 

In yet another aspect, the invention features nucleic acids encoding a DEF-3 
polypeptide, and DEF-3 polypeptides. Such DEF-3 nucleic acid and polypeptide have at 
5 least one SH3 consensus binding sequence, at least one, preferably four ankyrin repeat, 
at least one zinc finger domain, at least one pleckstrin homology domain, and at least 
one C2 domain. In one embodiment, the DEF-3 polypeptide has the above-identified 
domains and is encoded by a nucleic acid which is at least about 60% (preferably at least 
about 61-65%, 70%, 80%, 90% or 95-99%) identical to the nucleotide sequence of 

10 Figure 1 5 (SEQ ID NO: 9 or SEQ ID NO: 1 1). In another embodiment, the DEF-3 

polypeptide is encoded by a nucleic acid which is at least about 60% (preferably at least 
about 61-65%, 70%, 80%, 90% or 95-99%) identical to the nucleotide sequence of 
Figure 1 5 (SEQ ID NO: 9 or SEQ ID NO: 1 1). 

In other embodiments, the DEF-3 polypeptide has the above-identified domains 

15 and has an amino acid sequence which is at least about 70 % (preferably at least about 
71-74%, 75%, 80%, 90% or 95-99%) identical to the amino acid sequence of Figure 12 
(SEQ ID NO: 10). In other embodiments, the DEF-3 polypeptide has an amino acid 
sequence which is at least about 60% (preferably at least about 70%, 71-74%, 75%, 
80%, 90% or 95-99%) identical to the amino acid sequence of Figure 12 (SEQ ID NO: 

20 10). In still another embodiment, the DEF-3 polypeptide has at least one biological 
activity of a DEF polypeptide. In yet another embodiment, the DEF-3 polypeptide: (i) 
has one or more of the above-identified domains, (ii) is encoded by the above-described 
nucleic acids, (iii) has the above-described amino acid sequence, and (iv) at least one 
biological activity of a DEF polypeptide. 

25 In one embodiment, DEF polypeptides' include a src SH3 consensus 

binding sequence. As used herein, the language "src SH3 consensus binding 
sequence" is intended to include class I and, preferably, class II peptides which 
associate with an SH3 domain. The peptide ligand therefore has three spines, 
two contacting the SH3 domain, and the third stabilizing the PPII helix. The 

30 core ligand is a seven residue peptide containing the consensus X-P-p-X-P, 

where X is an aliphatic residue and the two conserved prolines (P) are necessary 
for high affinity binding. The intervening scaffolding residue (p) also tends to be 
a proline. Each X-P pair fits into a hydrophobic pocket formed by conserved 
SIB aromatic residues (sites 1 and 2), providing the principal binding energy. A 

35 third pocket (site 3) is more variable, although it frequently binds an arginine. 
Residues adjacent to the prolines also form contacts within the SH3 sequence 
and these interactions determine the specificity between a protein and a particular 
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SH3. For example, the arginine in "RPLPXXP" forms a salt bridge with 
aspartate 99 of pp60 c_jrc . However the C-terminal arginine in the sequence 
"AFAPPLPRR" contacts the identical aspartate in pp60°' src . This term is 
intended to encompass proteins that interact with with SH3 domains in either a 
5 "plus" or "minus" orientation (named "class I" and "class II" binding, 
respectively; Yu et al. (1994) Cell 76:933-945; Lim et al. (1994) Nature 
372:375-379). In one embodiment, the src SH3 consensus binding sequence has 
an amino acid sequence of up to 10 amino acids, preferably about 4-8 amino 
acids, most preferably about 6 amino acids and contains about amino acids 794- 

10 799, 803-809, 829-835, 895-901 or 993-999 of Figure 3 (SEQ ID NO: 2), amino 
acids 827-833, 892-898 or 1005-101 1 of Figure 12 (SEQ ID NO: 4), amino acids 
777-782 or 822-828 of Figure 12 (SEQ ID NO: 7), and amino acids 780-785, 
829-834, 834-840 or 867-873 of Figure 12 (SEQ ID NO: 10). 

In yet another embodiment, the DEF polypeptides include at least one 

15 motif having proline-rich stretch located between the SH3 domain and the 

predicted SH3 binding sites in DEF-1 . This region can be subdivided into six 
tandem repeats centered on the consensus sequence "GDLPPKP". The number 
of prolines in this repeat suggests that this region forms a left-handed polyproline 
type II helix (Williamson, M.P. (1994) Biochemical Journal 297:249-60). 

20 Accordingly, the four C-terminal repeats form a trigonal prism with an acidic 
"edge", a basic edge, and an uncharged edge (Figures 7A-7B). In one 
embodiment, the proline-rich repeat has an amino acid sequence of up to 75 
amino acids, preferably about 50-70 amino acids, most preferably about 65 
amino acids and contains about amino acids 934-1001 of Figure 3 (SEQ ID NO: 

25 2), or amino acids 944-1013 of Figure 12 (SEQ ID NO: 4). 

In still another embodiment, the DEF polypeptides include at least one 
motif having homology to an ankyrin repeat The term "ankyrin repeat" refers to 
an amino acid motif, preferably about 33 amino acids in length, which is 
typically repeated several times in an amino acid sequence, e.g., a motif repeated 

30 24 times in the protein ankyrin, and which is believed to be involved in directing 
the protein to the inner face of the plasma membrane (Hatada et al., 1992 Proc. 
Natl Acad. Sci. USA 89, 2489-2493; Michaely and Bennett, 1993; Lambert and 
Bennett, 1993). Ankyrin repeats have been found in several other proteins such 
as the transcription factor regulator, Ik-B (Hay, 1993), and the protooncogene 

35 Bcl-3 (Ohno et al., 1990 Gell 60: 991). In one embodiment, the ankyrin repeat 
sequence has an amino acid sequence of up to 25 amino acids, preferably about 
10-20 amino acids, most preferably about 18-19 amino acids and contains about 
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amino acids 356-374, 604-623, 640-659 or 672-692 of Figure 3 (SEQ ID NO: 2), 
amino acids 353-371, 601-620, 637-656, or 669-689 of Figure 12 (SEQ ID NO: 
4), amino acids 334-352, 585-604, 621-640, or 653-673 of Figure 12 (SEQ ID 
NO: 7), and amino acids 334-352, 584-603, 620-639, or 652-672 of Figure 12 
5 (SEQ ID NO: 10). 

In yet another embodiment, the DEF polypeptides include a pleckstrin 
homology (PH) domain. As used herein, a PH domain is a protein module of 
approximately 100 amino acids typically located at the carboxy-terminal of 
proteins involved in signal transduction processes (See also Haslam et al. (1993) 

10 supra; Mayer et al. (1993) supra; Musacchio et al. (1993) 775528:343-348). 

Typically, PH domains are very divergent and do not occupy a specific positions 
in molecules; alignments of PH domains show six conserved blocks, each 
containing several conserved hydrophobic residues which to form a folded 
structure comprising seven to eight p-strands, most likely in one or two p-sheets, 

1 5 and a single helix (Musacchio et al. supra). PH domains have been identified in 
kinases and also in Vav, Dbl, Bcr, yeast cdc24, Ras-GAP, DM GAP, Ras-GRF, 
Sos PH, protein kinase C a, phospholipase C-51 (Burgering, B.M.T. and P.J. 
Coffer (1995) supra; Franke et al. (1995) supra; Coffer, P J. and J.R. Woodgett 
(1991) supra), the serine/threonine kinase known variously as protein kinase B, 

20 Akt and Rac among others. The PH domain of P adrenergic receptor kinase may 
be involved in binding to G protein py complexes (Koch et al. (1993) J. Biol. 
Chem. 268:8256-8260). PH domains have been implicated in the binding to 
membranes containing PI 4,5-bisphosphate (Lemmon et al. (1995) supra), as 
well as to the binding of several proteins Py subunits (GPy) of heterotrimeric G 

25 proteins (Touhara et al. (1994) supra; Satoshi et al. (1994) supra; Lemmon et al. 
(1995) supra), protein kinase C (17), WD motifs (18). In addition, the isolated 
PH domain of PLCgl has been shown to specifically interact with high affinity 
with PI-4,5 P2 and D-myo-inositol 1,4,5 trisphosphate (Ins(l,4,5) P3) (Lemmon 
et al. (1995) supra). In one embodiment, the PH sequence has an amino acid 

30 sequence of up to 1 50 amino acids, preferably about 80-120 amino acids, most 
preferably about 100 amino acids and contains about amino acids 326-419 of 
Figure 3 (SEQ ID NO: 2), amino acids 323-416 of Figure 12 (SEQ ID NO: 4), 
amino acids 304-397 of Figure 12 (SEQ ID NO: 7), or amino acids 303-397 of 
Figure 12 (SEQ ID NO: 10). 

35 In another embodiment, the DEF polypeptides include a zinc finger 

domain. As used herein the term "ziftger finger domain" refers to a structural 
motif present in a family of transcription factors. An illustration of this class are 
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members of the GATA family of zinc finger-containing transcription factors, 
e.g., GATA-1 (Trainor, CD. et al. Nature 343:92-96(1990). Examples of 
eukaryotic proteins having a similar zinc finger motif include GCS 1 (Ireland et 
al., 1994), ROKa and ARF1GAP (Leung et al., 1995; Cukierman et al., 1995). 
5 This term is also intended to include motiffs that interact with G proteins and 
affect GTPase activity. In one embodiment, the zinger finger domain has an 
amino acid sequence of up to about 35 amino acids, preferably about 20-30 
amino acids, most preferably about 25 amino acids and contains about amino 
acids 457-480 of Figure 3 (SEQ ID NO: 2), amino acids 454-477 of Figure 12 

10 (SEQ ID NO: 4), amino acids 436-459 of Figure 12 (SEQ ID NO: 7), or amino 
acids 436-459 of Figure 12 (SEQ ID NO: 10). 

As used herein the language "SH3 domain" refers to a domain of 
approximately 60 amino acids in length named Src homology 3 which has been 
identified in numerous signal transduction proteins (Pawson, T. and J. 

15 Schlessinger (1993) J. Curr. Bio. 3:434-442; Courtneidge et al. (1994) Trends 
Cell Biol. 4:345-347; Pawson, T. (1 995) Nature 373: 573-580). These domains 
interact with other signal transduction proteins. In one embodiment, the SH3 
domain has an amino acid sequence of up to about 100 amino acids, preferably 
about 40-80 amino acids, most preferably about 60 amino acids and contains 

20 about amino acids 1073-1 123 of Figure 3 (SEQ ID NO: 2), amino acids 1095- 
1 145 of Figure 12 (SEQ ID NO: 4), or amino acids 926-976 of Figure 12 (SEQ 
ID NO: 7). 

As used herein the language "C2 domain" is intended to include a domain 
believed to be involved in lipid binding, primarily phosphatidylinositol binding. 

25 In one embodiment, the C2 domain has an amino acid sequence of up to about 70 
amino acids, preferably about 50-65 amino acids, most preferably about 60 
amino acids and contains about amino acids 498-557 of Figure 3 (SEQ ID NO: 
2), amino acids 495-554 of Figure 12 (SEQ ID NO: 4), amino acids 477-537 of 
Figure 12 (SEQ ID NO: 7), or amino acids 477-536 of Figure 12 (SEQ ID NO: 

30 10). 

In another embodiment, a portion of a DEF protein, e.g., a src SH3 
binding sequence, may antagonize the biological/biochemical activities of a 
naturally occurring DEF protein by acting as a dominant negative regulator of a 
DEF protein or a fragment therof. In another embodiment, a portion of a DEF 
35 protein, e.g., a zinc finger domain, may activate the biological/biochemical 
activities of a naturally occurring DEF protein. 
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Other aspects of the present invention relate to nucleic acids encoding 
DEF polypeptides, the DEF polypeptides themselves (including various 
fragments containing domains), antibodies immunoreactive with DEF proteins, 
and preparations of such compositions. Moreover, the present invention 
5 provides diagnostic and therapeutic assays and reagents for detecting and treating 
disorders involving, for example, aberrant expression (or loss thereof) of DEF, 
DEF-interacting molecules (particularly src^SH3 domain-containing proteins), or 
signal transducers thereof. 

In addition, drug discovery assays are provided for identifying agents 

1 0 which can modulate the biological function of DEF polypeptides, such as by 

altering the binding of DEF molecules to DEF interacting molecules (particularly 
src SH3 domain-containing proteins) or other intracellular targets (for example, 
cytoskeletal proteins). Such agents can be useful therapeutically to alter diseases 
dependent on cellular gene expression, cytoskeletal architecture, protein 

1 5 trafficking and endocytosis, cell adhesion, migration, proliferation and 
differentiation. 

Various aspects of the invention are described in further detail in the 
following subsections: 

20 I. Nucleic Acids 

As described below, one aspect of the invention pertains to isolated 
nucleic acids comprising nucleotide sequences encoding DEF polypeptides, 
and/or equivalents of such nucleic acids. 

The term "nucleic acid" refers to polynucleotides such as 

25 deoxyribonucleic acid (DNA), and, where appropriate, ribonucleic acid (RNA). 
The term should also be understood to include, as equivalents* analogs of either 
RNA or DNA made from nucleotide analogs, and, as applicable to the 
embodiment being described, single (sense or antisense) and double-stranded 
polynucleotides. The term "isolated" as used herein with respect to nucleic 

30 acids, such as DNA or RNA, refers to molecules separated from other DNAs, or 
RNAs, respectively, that are present in the natural source of the macromolecule. 
For example, an isolated nucleic acid encoding one of the subject mammalian 
DEF polypeptides preferably includes no more than 10 kilobases (kb) of nucleic 
acid sequence which naturally immediately flanks the mammalian DEF gene in 

35 genomic DNA, more preferably no more than 5kb of such naturally occurring 
flanking sequences, and most preferably less than 1 ,5kb of such naturally 
occurring flanking sequence. The term isolated as used herein also refers to a 
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nucleic acid or peptide that is substantially free of cellular material, viral 
material, or culture medium when produced by recombinant DNA techniques, or 
chemical precursors or other chemicals when chemically synthesized. Moreover, 
an "isolated nucleic acid" is meant to include nucleic acid fragments which are 
5 not naturally occurring as fragments and would not be found in the natural state. 
The term "isolated" is also used herein to refer to polypeptides which are isolated 
from other cellular proteins and is meant to encompass both purified and 
recombinant polypeptides. 

The term "equivalent" is understood to include nucleotide sequences 

10 encoding functionally equivalent DEF polypeptides or functionally equivalent 
polypeptides having a DEF bioactivity refer to molecules such as proteins and 
peptides which are capable of mimicking or antagonizing all or a portion of the 
biological/biochemical activities of a DEF protein. In addition a polypeptide 
has bioactivity if it is a specific agonist or antagonist (competitor) of a naturally- 

1 5 occurring form of a mammalian DEF protein. In one embodiment a DEF protein 
of the present invention has a DEF bioactivity if it is capable of binding to a src 
SH3 domain, a polypeptide capable of anchoring cytoskeletal elements to the 
plasma membrane, a polypeptide capable of modulating gene expression or G 
protein activity, e.g., GTPase activity, a polypeptide capable of inducing PPARy 

20 mRNA and protein expression. Equivalent nucleotide sequences will include 
sequences that differ by one or more nucleotide substitutions, additions or 
deletions, such as allelic variants; and will, therefore, include sequences that 
differ from the nucleotide sequence of the DEF gene shown in Figure 2 (SEQ ID 
NO:l), Figure 13 (SEQ ID NO: 3 or SEQ ID NO: 5), Figure 14 (SEQ ID NO: 6 

25 or SEQ ID NO: 8), or Figure 15 (SEQ ID NO: 9 or SEQ ID NO: 1 1) due to the 
degeneracy of the genetic code. 

Other equivalents of DEF include structural equivalents. Structural 
equivalents preferably comprise a motif, e.g., a src SH3 consensus binding 
sequence, a zinc finger domain, a proline-rich repeat, an SH3 domain, and an 

30 ankyrin repeat A portion of DEF polypeptide is at least about 5, 10, 1 5, 20, 25, 
30, 40, 50, 60, 70, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 
700, 750, 800, 850-1 125 amino acid residues in length, preferably at least about 
100-300 amino acid residues in length, more preferably at least about 140-260 
amino acid residues in length, and most preferably at least about 200 amino acid 

35 residues in length corresponding to a protein having at least 80% the amino acid 
sequence shown in Figure 3, (SEQ ID NO: 2), Figure 12 (SEQ ID NO: 4, SEQ 
ID NO: 7, or SEQ ID NO: 10). Preferred nucleotides of the present invention 
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include nucleic acid molecules comprising a nucleotide sequence provided in 
Figure 2 (SEQ. ID NO:l), Figure 13 (SEQ ID NO: 3 or SEQ ID NO: 5), Figure 
14 (SEQ ID NO: 6 or SEQ ID NO: 8), or Figure 15 (SEQ ID NO: 9 or SEQ ID 
NO: 1 1), fragments thereof or equivalents thereof. Most preferred portions of 
5 the nucleic acids and DEF polypeptides include at least one, more preferable two 
motifs. For example, a preferred portion of a DEF polypeptide include at least 
one proline-rich motiff and at least one SH3 domain. 

One embodiment the present invention features an isolated DEF nucleic 
acid molecule. In a preferred embodiment the DEF nucleic acid molecule of the 

1 0 present invention is isolated from a vertebrate organism. More preferred DEF 
nucleic acids are mammalian. Particularly preferred DEF nucleic acids are 
human or bovine. 

A particularly preferred DEF nucleic acid is shown in SEQ ID NO:l, 
SEQ ID NO: 3 or SEQ ID NO: 5, SEQ ID NO: 6 or SEQ ID NO: 8, SEQ ID NO: 

15 9 or SEQ ID NO: 1 1 . The term DEF nucleic acid is also meant to include 

nucleic acic sequences which are homologous to the sequence shown in SEQ ID 
NO:l, SEQ ID NO: 3 or SEQ ID NO: 5, SEQ ID NO: 6 or SEQ ID NO: 8, SEQ 
ID NO: 9 or SEQ ED NO: 1 lor a sequence which is complementary to that 
shown in SEQ ID NO:l, SEQ ID NO: 3 or SEQ ID NO: 5, SEQ ID NO: 6 or 

20 SEQ ID NO: 8, SEQ ID NO: 9 or SEQ ID NO: 1 1 . 

"Complementary" sequences as used herein refer to sequences which 
have sufficient complementarity to be able to hybridize, forming a stable duplex. 

As used herein, the term "specifically hybridizes" refers to the ability of a 
nucleic acid probe/primer of the invention to hybridize to at least 1 5 consecutive 

25 nucleotides of a DEF gene, such as a DEF sequence designated in SEQ ID NO: 
1, SEQ ID NO: 3 or SEQ ID NO: 5, SEQ ID NO: 6 or SEQ ID NO: 8, SEQ ID 
NO: 9 or SEQ ID NO: 1 1, or a sequence complementary thereto, or naturally 
occurring mutants thereof, such that it has less than 1 5%, preferably less than 
10%, and more preferably less than 5% background hybridization to a cellular 

30 nucleic acid (e.g., mRNA or genomic DNA) encoding a protein other than a DEF 
protein, as defined herein. 

"Homology" or "identity" or "similarity" refers to sequence similarity 
between two peptides or between two nucleic acid molecules. Homology can be 
determined by comparing a position in each sequence which may be aligned for 

35 purposes of comparison. When a position in the compared sequence is occupied 
by the same base or amino acid, then the molecules are homologous at that 
position. A degree of homology between sequences is a function of the number 
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of matching or homologous positions shared by the sequences. An "unrelated" 
or "non-homologous" sequence shares less than 40 % identity, though preferably 
less than 25 % identity, with one of the mammalian DEF sequences of the 
present invention. 

5 To determine the percent homology of two amino acid sequences or of 

two nucleic acids, the sequences are aligned for optimal comparison purposes 
(e.g., gaps can be introduced in the sequence of a first amino acid or nucleic acid 
sequence for optimal alignment with a second amino or nucleic acid sequence 
and non-homologous sequences can be disregarded for comparison purposes). In 

10 a preferred embodiment, the length of a reference sequence aligned for 

comparison purposes is at least 30%, preferably at least 40%, more preferably at 
least 50%, even more preferably at least 60%, and even more preferably at least 
70%, 80%, or 90% of the length of the reference sequence. The amino acid 
residues or nucleotides at corresponding amino acid positions or nucleotide 

1 5 positions are then compared. When a position in the first sequence is occupied 
by the same amino acid residue or nucleotide as the corresponding position in the 
second sequence, then the molecules are homologous at that position (i.e., as 
used herein amino acid or nucleic acid "homology" is equivalent to amino acid or 
nucleic acid "identity"). The percent homology between the two sequences is a 

20 function of the number of identical positions shared by the sequences (i.e., % 
homology = # of identical positions/total # of positions x 100). 

The comparison of sequences and determination of percent homology 
between two sequences can be accomplished using a mathematical algorithim. 
Preferably, the alignment can be performed using the Clustal Method. Multiple 

25 alignment paramethers include GAP Penalty =10, Gap Length Penalty = 1 0. For 
DNA alignments, the pairwise alignment paramenters can be Htuple=2, Gap 
penalty=5, Window=4, and Diagonal saved=4. For protein alignments, the 
pairwise alignment parameters can be Ktuple=l, Gap penalty=3, Window=5, and 
Diagonals Saved-5. 

30 Additional non-limiting example of a mathematical algorithim utilized 

for the comparison of sequences is the algorithm of Karlin and Altschul (1990) 
Proc. Natl. Acad. Sci. USA 87:2264-68, modified as in Karlin and Altschul 
(1 993) Proc. Natl. Acad. Sci. USA 90:5873-77. Such an algorithm is 
incorporated into the NBLAST and XBLAST programs (version 2.0) of 

35 Altschul, et al. (1990) J. Mol. Biol. 215:403-10. BLAST nucleotide searches can 
be performed performed with the NBLAST program, score = 1 00, wordlength = 
12 to obtain nucleotide sequences homologous to nucleic acid molecules of the 
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invention. BLAST protein searches can be performed with the XBLAST 
program, score = 50, wordlength = 3 to obtain amino acid sequences homologous 
to protein molecules of the invention. To obtain gapped alignments for 
comparison purposes, Gapped BLAST can be utilized as described in Altschul et 
5 al., (1 997) Nucleic Acids Research 25(1 7):3389-3402. When utilizing BLAST 
and Gapped BLAST programs, the default parameters of the respective programs 
(e.g., XBLAST and NBLAST) can be used. See http://www.ncbi.nlm.nih.gov. 
Another preferred, non-limiting example of a mathematical algorithim utilized 
for the comparison of sequences is the algorithm of Myers and Miller, CABIOS 

10 (1989). Such an algorithm is incorporated into the ALIGN program (version 2.0) 
which is part of the GCG sequence alignment software package. When utilizing 
the ALIGN program for comparing amino acid sequences, a PAM120 weight 
residue table, a gap length penalty of 1 2, and a gap penalty of 4 can be used. 

The term "ortholog" refers to genes or proteins which are homologs via 

1 5 speciation, e.g., closely related and assumed to have common descent based on 
structural and functional considerations. Orthologous proteins function as 
recognizably the same activity in different species. The term "paralog" refers to 
genes or proteins which are homologs via gene duplication, e.g., duplicated 
variants of a gene within a genome. See also, Fritch, WM (1970) Syst Zool 

20 19:99-113. 

Thus, nucleic acids having a sequence that differs from the nucleotide 
sequences shown in SEQ ID No: 1 due to degeneracy in the genetic code are also 
within the scope of the invention. Such nucleic acids encode functionally 
equivalent peptides (i.e., a peptide having a biological activity of a mammalian 

25 DEF polypeptide) but differ in sequence from the sequence shown in the 

sequence listing due to degeneracy in the genetic code. For example, a number 
of amino acids are designated by more than one triplet. Codons that specify the 
same amino acid, or synonyms (for example, CAU and CAC each encode 
histidine) may result in "silent" mutations which do not affect the amino acid 

30 sequence of a mammalian DEF polypeptide. However, it is expected that DNA 
sequence polymorphisms that do lead to changes in the amino acid sequences of 
the subject DEF polypeptides will exist among mammalians. One skilled in the 
art will appreciate that these variations in one or more nucleotides (e.g., up to 
about 3-5% of the nucleotides) of the nucleic acids encoding polypeptides having 

35 an activity of a mammalian DEF polypeptide may exist among individuals of a 
given species due to natural allelic variation. 
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In a preferred embodiment a DEF nucleic acid is at least about 85% 
homologous to the nucleic acid sequence shown in Figure 2 (SEQ. ID NO:l), 
Figure 13 (SEQ ID NO: 3 or SEQ ID NO: 5), Figure 14 (SEQ ID NO: 6 or SEQ 
ID NO: 8), or Figure 15 (SEQ ID NO: 9 or SEQ ID NO: 1 1) or its complement. 
5 In more preferred embodiments a DEF nucleic acid is at least about 90-99% 
homologous to the nucleic acid sequence shown in Figure 2 (SEQ. ID NO:l), 
Figure 13 (SEQ ID NO: 3 or SEQ ID NO: 5), Figure 14 (SEQ ID NO: 6 or SEQ 
ID NO: 8), or Figure 15 (SEQ ID NO: 9 or SEQ ID NO: 1 1). In particularly 
preferred embodiments a DEF nucleic acid sequence is identical to the nucleotide 

10 sequence of Figure 2 (SEQ. ID NO:l), Figure 13 (SEQ ID NO: 3 or SEQ ID NO: 
5), Figure 14 (SEQ ID NO: 6 or SEQ ID NO: 8), or Figure 15 (SEQ ID NO: 9 or 
SEQ ID NO: 11). 

In another embodiment a DEF nucleic acid includes a nucleic acid 
sequence at least 70% homologous to the nucleotide sequence of Figure 2 (SEQ. 

15 ID NO:l), Figure 13 (SEQ ID NO: 3 or SEQ ID NO: 5), Figure 14 (SEQ ID NO: 
6 or SEQ ID NO: 8), or Figure 15 (SEQ ID NO: 9 or SEQ ID NO: 1 1). In a 
preferred embodiment a DEF nucleic acid contains a sequence at least about 85% 
homologous to the nucleotide sequence of Figure 2 (SEQ. ID NO: 1 ), Figure 1 3 
(SEQ ID NO: 3 or SEQ ID NO: 5), Figure 14 (SEQ ID NO: 6 or SEQ ID NO: 8), 

20 or Figure 15 (SEQ ID NO: 9 or SEQ ID NO: 1 1). In a more preferred 

embodiment a DEF nucleic acid of the present invention contains a nucleotide 
sequence at least about 90-99% homologous to the nucleotide sequence of Figure 
2 (SEQ. ID NO:l), Figure 13 (SEQ ID NO: 3 or SEQ ID NO: 5), Figure 14 (SEQ 
ID NO: 6 or SEQ ID NO: 8), or Figure 15 (SEQ ID NO: 9 or SEQ ID NO: 11). 

25 In a particularly preferred embodiment a DEF nucleic acid contains a sequence 
identical to the nucleotide sequence of Figure 2 (SEQ. ID NO: 1), Figure 13 
(SEQ ID NO: 3 or SEQ ID NO: 5), Figure 14 (SEQ ID NO: 6 or SEQ ID NO: 8), 
or Figure 15 (SEQ ID NO: 9 or SEQ ID NO: 1 1). 

In another embodiment a DEF nucleic acid includes a nucleic acid 

30 sequence at least 80% homologous to the nucleotide sequence of Figure 2 (SEQ. 
ID NO:l), Figure 13 (SEQ ID NO: 3 or SEQ ID NO: 5), Figure 14 (SEQ ID NO: 
6 or SEQ ID NO: 8), or Figure 15 (SEQ ID NO: 9 or SEQ ID NO: 1 1). In a 
preferred embodiment a DEF nucleic acid contains a sequence at least about 85% 
homologous to the nucleotide sequence of Figure 2 (SEQ. ID NO:l), Figure 13 

35 (SEQ ID NO: 3 or SEQ ID NO: 5), Figure 14 (SEQ ID NO: 6 or SEQ ID NO: 8), 
or Figure 15 (SEQ ID NO: 9 or SEQ ID NO: 11). In a more preferred 
embodiment a DEF nucleic acid of the present invention contains a nucleotide 
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sequence at least about 90% homologous to the nucleotide sequence of Figure 2 
(SEQ. ID NO:l), Figure 13 (SEQ ID NO: 3 or SEQ ID NO: 5), Figure 14 (SEQ 
ID NO: 6 or SEQ ID NO: 8), or Figure 1 5 (SEQ ID NO: 9 or SEQ ID NO: 1 1). 
In a particularly preferred embodiment a DEF nucleic acid contains a sequence 
5 identical to the nucleotide sequence of Figure 2 (SEQ. ID NO: 1 ), Figure 1 3 

(SEQ ID NO: 3 or SEQ ID NO: 5), Figure 14 (SEQ ID NO: 6 or SEQ ID NO: 8), 
or Figure 15 (SEQ ID NO: 9 or SEQ ID NO: 1 1). 

In one embodiment a DEF nucleic acid contains a nucleotide sequence at 
least about 70% homologous to the sequence of Figure 2 (SEQ; ID NO:l), 

10 Figure 13 (SEQ ID NO: 3 or SEQ ID NO: 5), Figure 14 (SEQ ID NO: 6 or SEQ 
ID NO: 8), or Figure 15 (SEQ ID NO: 9 or SEQ ID NO: 1 1) and encodes a 
polypeptide with a DEF bioactivity, e.g., induces adipogenesis or neurogenesis. 
In a preferred embodiment a DEF nucleic acid contains a nucleotide sequence at 
least about 80% homologous to the sequence of Figure 2 (SEQ. ID NO:l), 

15 Figure 13 (SEQ ID NO: 3 or SEQ ID NO: 5), Figure 14 (SEQ ID NO: 6 or SEQ 
ID NO: 8), or Figure 15 (SEQ ID NO: 9 or SEQ ID NO: 1 1) and encodes a 
polypeptide with a DEF bioactivity. In a more preferred embodiment a DEF 
nucleic acid contains a nucleotide sequence at least about 90-99% homologous to 
the sequence of Figure 2 (SEQ. ID NO:l), Figure 13 (SEQ ID NO: 3 or SEQ ID 

20 NO: 5), Figure 14 (SEQ ID NO: 6 or SEQ ID NO: 8), or Figure 15 (SEQ ID NO: 
9 or SEQ ID NO: 11) and encodes a polypeptide with a DEF bioactivity. In a 
particularly preferred embodiment a DEF nucleic acid contains a nucleotide 
sequence identical to the sequence of Figure 2 (SEQ. ID NO:l), Figure 2 (SEQ. 
ID NO:l), Figure 13 (SEQ ID NO: 3 or SEQ ID NO: 5), Figure 14 (SEQ ID NO: 

25 6 or SEQ ID NO: 8), or Figure 15 (SEQ ID NO: 9 or SEQ ID NO: 1 1) and 
encodes a polypeptide with a DEF bioactivity. 

In a preferred embodiment a DEF nucleic acid is at least about 90% 
homologous to the coding sequence shown in Figure 2 (SEQ ID NO:l), Figure 
13 (SEQ ID NO: 3), Figure 14 (SEQ ID NO: 6), or Figure 15 (SEQ ID NO: 9) or 

30 its complement. In more preferred embodiments a DEF nucleic acid is at least 
about 96-97% homologous to the coding sequence shown in Figure 2 (SEQ ID 
NO:l), Figure 13 (SEQ ID NO: 3), Figure 14 (SEQ ID NO: 6), or Figure 15 
(SEQ ID NO: 9). In particularly preferred embodiments a DEF nucleic acid 
sequence is identical to the coding sequence of Figure 2 (SEQ ID NO:l), Figure 

35 13 (SEQ ID NO: 3), Figure 14 (SEQ ID NO: 6), or Figure 15 (SEQ ID NO: 9). 
A DEF nucleic acid molecucle can include an open reading frame 
encoding one of the mammalian DEF polypeptides of the present invention, 
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including both exon and (optionally) intron sequences. A "recombinant gene" 
refers to nucleic acid encoding a mammalian DEF polypeptide and comprising 
mammalian DEF-encoding exon sequences, though it may optionally include 
intron sequences which are either derived from a chromosomal mammalian DEF 
5 gene or from an unrelated chromosomal gene. The term "intron" refers to a 
DNA sequence present in a given mammalian DEF gene which is not translated 
into protein and is generally found between exons. 

In certain embodiments the subject DEF nucleic acid molecules include 
the 5 f and 3' untranslated sequences which flank the gene, i.e., noncoding 

10 sequences, and do not encode for amino acids of a DEF polypeptide. In a 

preferred embodiment a DEF nuceleic acid molecule contains the coding region 
of SEQ ID NO: 1 , SEQ ID NO: 3, SEQ ID NO: 6, or SEQ ID NO: 9. 

"Transcriptional regulatory sequence" is a term used throughout the 
specification to refer to DNA sequences, such as initiation signals, enhancers, 

1 5 and promoters, which induce or control transcription of protein coding sequences 
with which they are operatively linked. In preferred embodiments, transcription 
of one of the recombinant mammalian DEF genes is under the control of a 
promoter sequence (or other transcriptional regulatory sequence) which controls 
the expression of the recombinant gene in a cell-type in which expression is 

20 intended. It will also be understood that the recombinant gene can be under the 
control of transcriptional regulatory sequences which are the same or which are 
different from those sequences which control transcription of the naturally- 
occurring forms of DEF proteins. 

Another aspect of the invention provides a nucleic acid which hybridizes 

25 under stringent conditions to a nucleic acid represented by Figure 2 (SEQ. ID 
NO:l), Figure 13 (SEQ ID NO: 3 or SEQ ID NO: 5), Figure 14 (SEQ ID NO: 6 
or SEQ ID NO: 8), or Figure 15 (SEQ ID NO: 9 or SEQ ID NO: 1 1) or its 
complement Appropriate stringency conditions which promote DNA 
hybridization, for example, 6.0 x sodium chloride/sodium citrate (SSC) at about 

30 45°C, followed by a wash of 2.0 x SSC at 50°C, are known to those skilled in the 
art or can be found in Current Protocols in Molecular Biology, John Wiley & 
Sons, N.Y. (1989), 6.3.1-6.3.6. For example, the salt concentration in the wash 
step can be selected from a low stringency of about 2.0 x SSC at 50°C to a high 
stringency of about 0.2 x SSC at 50°C. In addition, the temperature in the wash 

35 step can be increased from low stringency conditions at room temperature, about 
22°C, to high stringency conditions at about 65°C. Both temperature and salt 
may be varied, or temperature or salt concentration may be held constant while 
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the other variable is changed. In a particularly preferred embodiment, a DEF 
nucleic acid of the present invention will bind to SEQ. ID NO: 1 , SEQ ID NO: 3 
or SEQ ID NO: 5, SEQ ID NO: 6 or SEQ ID NO: 8, or SEQ ID NO: 9 or SEQ 
ID NO: 1 1 under stringent conditions. 
5 As used herein, the term "specifically hybridizes" or "specifically detects" 

refers to the ability of a nucleic acid molecule of the invention to hybridize to at 
least approximately 6, 12, 20, 30, 50, 100, 150, 200, or 300 consecutive 
nucleotides of a vertebrate, preferably mammalian, DEF gene, such as a DEF 
sequence designated in SEQ. ID NO:l, SEQ ID NO: 3 or SEQ ID NO: 5, SEQ 

1 0 ID NO: 6 or SEQ ID NO: 8, or SEQ ID NO: 9 or SEQ ID NO: 1 1 , or a sequence 
complementary thereto, or naturally occurring mutants thereof, such that it shows 
more than 10 times more hybridization, preferably more than 100 times more 
hybridization, and even more preferably more than 100 times more hybridization 
than it does to to a cellular nucleic acid (e.g., mRNA or genomic DNA) encoding 

1 5 a protein other than a vertebrate, preferably mammalian, DEF protein as defined 
herein. In a particularly preferred embodiment a DEF nucleic acid fragment 
specifically detects a DEF, and not dynamin or dynamin-related sequences. 

In a further embodiment a DEF nucleic acid sequence encodes a 
vertebrate DEF polypeptide. Preferred nucleic acids of the present invention 

20 encode a DEF polypeptide which includes a polypeptide sequence corresponding 
to all or a portion of amino acid residues of SEQ ID NO:2, SEQ ID NO: 4, SEQ 
ID NO: 7, or SEQ ID NO: 10, e.g., at least 5, 10, 15, 20, 25, 30, 40, 50, 60, 70, 
100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850- 
1 125 amino acid residues of that region. Genes for a particular polypeptide may 

25 exist in single or multiple copies within the genome of an individual. Such 
duplicate genes may be identical or may have certain modifications, including 
nucleotide substitutions, additions or deletions, which all still code for 
polypeptides having substantially the same activity. The term "nucleic acid 
sequence encoding a vertebrate DEF polypeptide" may thus refer to one or more 

30 genes within a particular individual. Moreover, certain differences in nucleotide 
sequences may exist between individual organisms, which are called alleles. 
Such allelic differences may or may not result in differences in amino acid 
sequence of the encoded polypeptide yet still encode a protein with the same 
bioactivity. 

35 In one embodiment a DEF nucleic acid encodes a polypeptide sequence 

at least 85% homologous to the sequence shown in SEQ ID NO: 2, SEQ ID NO: 
4, SEQ ID NO: 7 or SEQ ID NO: 10. In a preferred embodiment a DEF nucleic 
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acid encodes a sequence at least 91-99% homologous to the sequence shown in 
SEQ ID NO: 2, SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 7 or SEQ ID NO: 
10. In a more preferred embodiment a DEF nucleic acid encodes a sequence at 
least about 95 % homologous to the sequence shown in SEQ ID NO: 2, SEQ ID 
5 NO: 2, SEQ ID NO: 4, SEQ ID NO: 7 or SEQ ID NO: 10. In a particularly 
preferred embodiment the subject DEF nucleic acid molecule encodes the 
polypeptide shown in SEQ ID NO. 2, SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
NO: 7 or SEQ ID NO: 10. 

In another embodiment a DEF nucleic acid molecule encodes a 

10 polypeptide with a DEF bioactivity and contains a src consensus binding 

sequence, at least one ankyrin repeat, a zinc finger domain, a proline-rich repeat, 
a C2 domain and a PH domain. 

The subject DEF nucleic acid sequences allow for the generation of 
nucleic acid fragments (e.g., probes and primers) designed for use in identifying 

1 5 and/or cloning DEF homologs in other cell types, e.g. from other tissues, as well 
as DEF homologs from other mammalian organisms. For instance, the present 
invention also provides a nucleic acid fragment that can be used as a primer. The 
fragment can comprise a substantially purified oligonucleotide, containing a 
region of nucleotide sequence that hybridizes under stringent conditions to at 

20 least approximately 12, preferably 25, more preferably 40, 50 or 75 consecutive 
nucleotides of sense or anti-sense sequence of SEQ ID NO:l, SEQ. ID NO:l, 
SEQ ID NO: 3 or SEQ ID NO: 5, SEQ ID NO: 6 or SEQ ID NO: 8, or SEQ ID 
NO: 9 or SEQ ID NO: 1 1, or naturally occurring mutants thereof. For instance, 
primers based on the nucleic acid represented in SEQ ID NO: 1 , SEQ. ID NO: 1 , 

25 SEQ ID NO: 3 or SEQ ID NO: 5, SEQ ID NO: 6 or SEQ ID NO: 8, or SEQ ID 
NO: 9 or SEQ ID NO: 1 1 can be used in PCR reactions to clone DEF homologs. 

In another embodiment, a DEF nucleic acid fragment is an 
oligonucleotide probe which specifically detects a DEF nucleic and relative to a 
dynamin or dynamin-related nucleic acid sequences. In a preferred embodiment 

30 the subject oligonucleotide hybridizes under stringent conditions to at least 6 

consecutive nucleotides encoding the DEF nucleic acid (SEQ ID NO: 1, SEQ. ID 
NO:l, SEQ ID NO: 3 or SEQ ID NO: 5, SEQ ID NO: 6 or SEQ ID NO: 8, or 
SEQ ID NO: 9 or SEQ ID NO: 1 1). 

In preferred embodiments, the probe further contains a label group 

35 capable of detection, e.g. the label group can be a radioisotope, fluorescent 
compound, enzyme, or enzyme co-factor. Probes based on the subject DEF 
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sequences can also be used to detect transcripts or genomic sequences encoding 
the same or homologous proteins. 

As discussed in more detail below, the probes of the present invention 
can also be used as a part of a diagnostic test kit for identifying cells or tissue 
5 which misexpress a DEF protein, such as by measuring a level of a DEF- 
encoding nucleic acid in a sample of cells from a patient; e.g. detecting DEF 
mRNA levels or determining whether a genomic DEF gene has been mutated or 
deleted. Briefly, nucleotide probes can be generated from the subject DEF genes 
which facilitate histological screening of intact tissue and tissue samples for the 

1 0 presence (or absence) of DEF-encoding transcripts. Similar to the diagnostic 
uses of anti-DEF antibodies, the use of probes directed to DEF messages, or to 
genomic DEF sequences, can be used for both predictive and therapeutic 
evaluation of allelic mutations which might be manifest in certain disorders. 
Used in conjunction with immunoassays as described herein, the oligonucleotide 

1 5 probes can help facilitate the determination of the molecular basis for a disorder 
which may involve some abnormality associated with expression (or lack 
thereof) of a DEF protein. For instance, variation in polypeptide synthesis can 
be differentiated from a mutation in a coding sequence. 

Another aspect of the invention relates to the use of isolated DEF nucleic 

20 acids in "antisense" therapy. As used herein, "antisense" therapy refers to 
administration or in situ generation of oligonucleotide molecules or their 
derivatives which specifically hybridize (e.g. bind) under cellular conditions, 
with the cellular mRNA and/or genomic DNA encoding one or more of the 
subject DEF proteins so as to inhibit expression of that protein, e.g. by inhibiting 

25 transcription and/or translation. The binding may be by conventional base pair 
complementarity, or, for example, in the case of binding to DNA duplexes, 
through specific interactions in the major groove of the double helix. In general, 
"antisense" therapy refers to the range of techniques generally employed in the 
art, and includes any therapy which relies on specific binding to oligonucleotide 

30 sequences. 

An antisense construct of the present invention can be delivered, for 
example, as an expression plasmid which, when transcribed in the cell, produces 
RNA which is complementary to at least a unique portion of the cellular mRNA 
which encodes a mammalian DEF protein. Alternatively, the antisense construct 
35 is an oligonucleotide probe which is generated ex vivo and which, when 

introduced into the cell causes inhibition of expression by hybridizing with the 
mRNA and/or genomic sequences of a mammalian DEF gene. Such 
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oligonucleotide probes are preferably modified oligonucleotides which are 
resistant to endogenous nucleases, e.g. exonucleases and/or endonucleases, and 
are therefore stable in vivo. Exemplary nucleic acid molecules for use as 
antisense oligonucleotides are phosphoramidate, phosphothioate and 
5 methylphosphonate analogs of DNA (see also U.S. Patents 5,176,996; 5,264,564; 
and 5,256,775). Additionally, general approaches to constructing oligomers 
useful in antisense therapy have been reviewed, for example, by Van der Krol et 
al. (1988) Biotechniques 6:958-976; and Stein et al. (1988) Cancer Res 48:2659- 
2668. 

10 Antisense approaches involve the design of oligonucleotides (either DNA 

or RNA) that are complementary to DEF mRNA. The antisense oligonucleotides 
will bind to the DEF mRNA transcripts and prevent translation. Absolute 
complementarity, although preferred, is not required. A sequence 
"complementary" to a portion of an RNA, as referred to herein, means a 

1 5 sequence having sufficient complementarity to be able to hybridize with the 

RNA, forming a stable duplex. In the case of double-stranded antisense nucleic 
acids, a single strand of the duplex DNA may thus be tested, or triplex formation 
may be assayed. The ability to hybridize will depend on both the degree of 
complementarity and the length of the antisense nucleic acid. Generally, the 

20 longer the hybridizing nucleic acid, the more base mismatches with an RNA it 
may contain and still form a stable duplex (or triplex, as the case may be). One 
skilled in the art can ascertain a tolerable degree of mismatch by use of standard 
procedures to determine the melting point of the hybridized complex. 

Oligonucleotides that are complementary to the 5' end of the message, 

25 e.g., the 5' untranslated sequence up to and including the AUG initiation codon, 
should work most efficiently at inhibiting translation. However, sequences 
complementary to the 3* untranslated sequences of mRNAs have recently been 
shown to be effective at inhibiting translation of mRNAs as well. (Wagner, R 
1994. Nature 372:333). Therefore, oligonucleotides complementary to either the 

30 5 f or 3' untranslated, non-coding regions of a DEF gene could be used in an 
antisense approach to inhibit translation of endogenous DEF mRNA. 
Oligonucleotides complementary to the 5' untranslated region of the mRNA 
should include the complement of the AUG start codon. Antisense 
oligonucleotides complementary to mRNA coding regions are less efficient 

35 inhibitors of translation but could be used in accordance with the invention. 
Whether designed to hybridize to the 5', 3' or coding region of DEF mRNA, 
antisense nucleic acids should be at least six nucleotides in length, and are 
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preferably oligonucleotides ranging from 6 to about 50 nucleotides in length. In 
certain embodiments, the oligonucleotide is at least 10 nucleotides, at least 17 
nucleotides, at least 25 nucleotides, or at least 50 nucleotides. 

Regardless of the choice of target sequence, it is preferred that in vitro 
5 studies are first performed to quantitate the ability of the antisense 

oligonucleotide to quantitate the ability of the antisense oligonucleotide to inhibit 
gene expression. It is preferred that these studies utilize controls that distinguish 
between antisense gene inhibition and nonspecific biological effects of 
oligonucleotides. It is also preferred that these studies compare levels of the 

1 0 target RNA or protein with that of an internal control RN A or protein. 
Additionally, it is envisioned that results obtained using the antisense 
oligonucleotide are compared with those obtained using a control 
oligonucleotide. It is preferred that the control oligonucleotide is of 
approximately the same length as the test oligonucleotide and that the nucleotide 

15 sequence of the oligonucleotide differs from the antisense sequence no more than 
is necessary to prevent specific hybridization to the target sequence. 

The oligonucleotides can be DNA or RNA or chimeric mixtures or 
derivatives or modified versions thereof, single-stranded or double-stranded. 
The oligonucleotide can be modified at the base moiety, sugar moiety, or 

20 phosphate backbone, for example, to improve stability of the molecule, 

hybridization, etc. The oligonucleotide may include other appended groups such 
as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 
transport across the cell membrane (see, e.g., Letsinger et al., 1989, Proc. Natl. 
Acad. Sci. U.S.A. 86:6553-6556; Lemaitre et al., 1987, Proc. Natl. Acad. Sci. 

25 84:648-652; PCT Publication No. W088/09810, published December 15, 1988) 
or the blood-brain barrier (see, e.g., PCT Publication No. W089/10134, 
published April 25, 1988), hybridization-triggered cleavage agents. (See, e.g., 
Krol et al., 1988, BioTechniques 6:958-976) or intercalating agents. (See, e.g., 
Zon, 1988, Pharm. Res. 5:539-549). To this end, the oligonucleotide maybe 

30 conjugated to another molecule, e.g., a peptide, hybridization triggered cross- 
linking agent, transport agent, hybridization-triggered cleavage agent, etc. 

While antisense nucleotides complementary to the DEF coding region 
sequence could be used, those complementary to the transcribed untranslated 
region are most preferred. 

35 The antisense molecules can be delivered to cells which express the DEF 

in vivo or in vitro. A number of methods have been developed for delivering 
antisense DNA or RNA to cells; e.g., antisense molecules can be injected directly 
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into the tissue site, or modified antisense molecules, designed to target the 
desired cells (e.g., antisense linked to peptides or antibodies that specifically bind 
receptors or antigens expressed on the target cell surface) can be administered 
systematically. 

5 Since, it is often difficult to achieve intracellular concentrations of the 

antisense sufficient to suppress translation on endogenous mRNAs, a preferred 
approach utilizes a recombinant DNA construct in which the antisense 
oligonucleotide is placed under the control of a strong pol HI or pol II promoter. 
The use of such a construct to transfect target cells in the patient will result in the 

1 0 transcription of sufficient amounts of single stranded RN As that will form 
complementary base pairs with the endogenous DEF transcripts and thereby 
prevent translation of the DEF mRNA. For example, a vector can be introduced 
in vivo such that it is taken up by a cell and directs the transcription of an 
antisense RNA. Such a vector can remain episomal or become chromosomally 

1 5 integrated, as long as it can be transcribed to produce the desired antisense RNA. 
Such vectors can be constructed by recombinant DNA technology methods 
standard in the art. Vectors can be plasmid, viral, or others known in the art, 
used for replication and expression in mammalian cells. Expression of the 
sequence encoding the antisense RNA can be by any promoter known in the art 

20 to act in mammalian, preferably human cells. Such promoters can be inducible 
or constitutive. Such promoters include but are not limited to: the SV40 early 
promoter region (Bernoist and Chambon, 1981, Nature 290:304-3 1 0), the 
promoter contained in the 3 1 long terminal repeat of Rous sarcoma virus 
(Yamamotp et al., 1980, Cell 22:787-797), the herpes thymidine kinase promoter 

25 (Wagner et al., 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445), the 

regulatory sequences of the metallothionein gene (Brinster et al, 1982, Nature 
296:39-42), etc. Any type of plasmid, cosmid, YAC or viral vector can be used 
to prepare the recombinant DNA construct which can be introduced directly into 
the tissue site; e.g., the choroid plexus or hypothalamus. Alternatively, viral 

30 vectors can be used which selectively infect the desired tissue; (e.g., for brain, 
herpesvirus vectors may be used), in which case administration may be 
accomplished by another route (e.g., systematically). 

Ribozyme molecules designed to catalytically cleave DEF mRNA 
transcripts can also be used to prevent translation of DEF mRNA and expression 

35 of DEF. (See, e.g., PCT International Publication WO90/1 1364, published 

October 4, 1990; Sarver et al., 1990, Science 247:1222-1225). While ribozymes 
that cleave mRNA at site specific recognition sequences can be used to destroy 
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DEF mRNAs, the use of hammerhead ribozymes is preferred. Hammerhead 
ribozymes cleave mRNAs at locations dictated by flanking regions that form 
complementary base pairs with the target mRNA. The sole requirement is that 
the target mRNA have the following sequence of two bases: 5-UG-3'. The 
5 construction and production of hammerhead ribozymes is well known in the art 
and is described more fully in Haseloff and Gerlach, 1988, Nature, 334:585-591. 
There are numerous potential hammerhead ribozyme cleavage sites within the 
nucleotide sequence of human DEF cDNA. Preferably the ribozyme is 
engineered so that the cleavage recognition site is located near the 5' end of the 

1 0 DEF mRNA; i.e., to increase efficiency and minimize the intracellular 
accumulation of non-functional mRNA transcripts. 

Ribozymes of the present invention also include RNA endoribonucleases 
(hereinafter "Cech-type ribozymes") such as the one which occurs naturally in 
Tetrahymena Thermophila (known as the IVS, or L-19 IVS RNA) and which has 

1 5 been extensively described by Thomas Cech and collaborators (Zaug, et al., 

1984, Science, 224:574-578; Zaug and Cech, 1986, Science, 231:470-475; Zaug, 
et al., 1986, Nature, 324:429-433; published International patent application No. 
WO88/04300 by University Patents Inc.; Been and Cech, 1986, Cell, 47:207- 
216). The Cech-type ribozymes have an eight base pair active site which 

20 hybridizes to a target RNA sequence whereafter cleavage of the target RNA 

takes place. The invention encompasses those Cech-type ribozymes which target 
eight base-pair active site sequences that are present in DEF. 

As in the antisense approach, the ribozymes can be composed of 
modified oligonucleotides (e.g. for improved stability, targeting, etc.) and should 

25 be delivered to cells which express the DEF in vivo e.g., T cells. A preferred 
method of delivery involves using a DNA construct "encoding" the ribozyme 
under the control of a strong constitutive pol III or pol II promoter, so that 
transfected cells will produce sufficient quantities of the ribozyme to destroy 
endogenous DEF and inhibit translation. Because ribozymes unlike antisense 

30 molecules, are catalytic, a lower intracellular concentration is required for 
efficiency. 

Endogenous DEF gene expression can also be reduced by inactivating or 
"knocking out" the DEF gene or its promoter using targeted homologous 
recombination. (E.g., see Smithies et al., 1985, Nature 317:230-234; Thomas & 
35 Capecchi, 1987, Cell 51:503-512; Thompson et al., 1989 Cell 5:313-321; each of 
which is incorporated by reference herein in its entirety). For example, a mutant, 
non-functional DEF (or a completely unrelated DNA sequence) flanked by DNA 
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homologous to the endogenous DEF gene (either the coding regions or 
regulatory regions of the DEF gene) can be used, with or without a selectable 
marker and/or a negative selectable marker, to transfect cells that express DEF in 
vivo. Insertion of the DNA construct, via targeted homologous recombination, 
5 results in inactivation of the DEF gene. Such approaches are particularly suited 
in the generation of animal offspring with an inactive DEF (e.g., see Thomas & 
Capecchi 1987 and Thompson 1989, supra). However this approach can be 
adapted for use in humans provided appropriate delivery means are used. 
Alternatively, endogenous DEF gene expression can be reduced by 

10 targeting deoxyribonucleotide sequences complementary to the regulatory region 
of the DEF gene (i.e., the DEF promoter and/or enhancers) to form triple helical 
structures that prevent transcription of the DEF gene in target cells in the body. 
(See generally, Helene, C. 1991, Anticancer Drug Des., 6(6):569-84; Helene, C, 
et al., 1992, Ann, N.Y. Accad. Sci., 660:27-36; and Maher, L J., 1992, Bioassays 

15 14(12):807-15). 

Nucleic acid molecules to be used in triple helix formation for the 
inhibition of transcription are preferably single stranded and composed of 
deoxyribonucleotides. The base composition of these oligonucleotides should 
promote triple helix formation via Hoogsteen base pairing rules, which generally 

20 require sizable stretches of either purines or pyrimidines to be present on one 
strand of a duplex. Nucleotide sequences may be pyrimidine-based, which will 
result in TAT and CGC triplets across the three associated strands of the 
resulting triple helix. The pyrimidine-rich molecules provide base 
complementarity to a purine-rich region of a single strand of the duplex in a 

25 parallel orientation to that strand. In addition, nucleic acid molecules may be 

chosen that are purine-rich. These molecules will form a triple helix with a DNA 
duplex that is rich in GC pairs, in which the majority of the purine residues are 
located on a single strand of the targeted duplex, resulting in CGC triplets across 
the three strands in the triplex. 

30 Alternatively, the potential sequences that can be targeted for triple helix 

formation may be increased by creating a so called "switchback" nucleic acid 
molecule. Switchback molecules are synthesized in an alternating 5'-3', 3'-5' 
manner, such that they base pair with first one strand of a duplex and then the 
other, eliminating the necessity for a sizable stretch of either purines or 

35 pyrimidines to be present on one strand of a duplex. 

In yet another embodiment, the antisense oligonucleotide is an a- 
anomeric oligonucleotide. An a-anomeric oligonucleotide forms specific 
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double-stranded hybrids with complementary RNA in which, contrary to the 
usual P-units, the strands run parallel to each other (Gautier et al., 1987, Nucl. 
Acids Res. 15:6625-6641). The oligonucleotide is a 2 , -0-methylribonucleotide 
(Inoueetal., 1987, NucL Acids Res. 15:6131-6148), or a chimeric RNA-DNA 
5 analogue (Inoue et al., 1987, FEBS Lett. 215:327-330). 

DEF nucleic acids can be obtained from mRNA present in any of a 
number of eukaryotic cells. It should also be possible to obtain nucleic acids 
encoding mammalian DEF polypeptides of the present invention from genomic 
DNA from both adults and embryos. For example, a gene encoding a DEF 

10 protein can be cloned from either a cDNA or a genomic library in accordance 
with protocols described herein, as well as those generally known to persons 
skilled in the art Examples of tissues and/or libraries suitable for isolation of the 
subject nucleic acids include T cells, among others. A cDNA encoding a DEF 
protein can be obtained by isolating total mRNA from a cell, e.g. a vertebrate 

1 5 cell, a mammalian cell, or a human cell, including embryonic cells. Double 
stranded cDNAs can then be prepared from the total mRNA, and subsequently 
inserted into a suitable plasmid or bacteriophage vector using any one of a 
number of known techniques. The gene encoding a mammalian DEF protein can 
also be cloned using established polymerase chain reaction techniques in 

20 accordance with the nucleotide sequence information provided by the invention. 
The nucleic acid of the invention can be DNA or RNA. A preferred nucleic acid 
is a cDNA represented by a sequence shown in SEQ ID NO: 1 , SEQ. ID NO: 1 , 
SEQ ID NO: 3 or SEQ ID NO: 5, SEQ ID NO: 6 or SEQ ID NO: 8, or SEQ ID 
NO: 9 or SEQ ID NO: 11. 

25 Alternatively, RNA molecules may be generated by in vitro and in vivo 

transcription of DNA sequences encoding the antisense RNA molecule. Such 
DNA sequences may be incorporated into a wide variety of vectors which 
incorporate suitable RNA polymerase promoters such as the T7 or SP6 
polymerase promoters. Alternatively, antisense cDNA constructs that synthesize 

30 antisense RNA constitutively or inducibly, depending on the promoter used, can 
be introduced stably into cell lines. 

Any of the subject nucleic acids can also be obtained by chemical 
synthesis. For example, nucleic acids of the invention may be synthesized by 
standard methods known in the art, e.g. by use of an automated DNA synthesizer 

35 (such as are commercially available from Biosearch, Applied Biosystems, etc.). 
As examples, phosphorothioate oligonucleotides may be synthesized by the 
method of Stein et al. (1 988) NucL Acids Res. 16:3209, methylphosphonate 
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olgonucleotides can be prepared by use of controlled pore glass polymer supports 
(Sarin et al., 1988, Proc. Natl. Acad. Sci. U.S.A. 85:7448-7451), etc. Other 
techniques for chemically synthesizing oligodeoxyribonucleotides and 
oligoribonucleotides well known in the art such as for example solid phase 
5 phosphoramidite chemical synthesis. 

Moreover, various well-known modifications to nucleic acid molecules 
may be introduced as a means of increasing intracellular stability and half-life. 
Possible modifications include but are not limited to the addition of flanking 
sequences of ribonucleotides or deoxyribonucleotides to the 5* and/or 3' ends of 

10 the molecule or the use of phosphorothioate or 2* O-methyl rather than 

phosphodiesterase linkages within the oligodeoxyribonucleotide backbone. 

The subject nucleic acids may also contain modified bases. For example, 
a nucleic acid may comprise at least one modified base moiety which is selected 
from the group including but not limited to 5-fluorouraciI, 5-bromouracil, 5- 

15 chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetylcytosine, 5- 

(carboxyhydroxylmethyl) uracil, 5-<^boxymethylaminomethyl-2-thiouridine, 5- 
carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 
inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2- 
dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5- 

20 methylcytosine, N6-adenine,7-methylguanine, 5-methylaminomethyluracil, 5- 
methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 5- 
methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 
isopentenyladenine, uracil-5-oxy acetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5- 

25 methyluracil, uracil-5- oxy acetic acid methylester, uracil-5-oxy acetic acid (v), 5- 
methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6- 
diaminopurine. 

A modified nucleic acid of the present invention may also include at least 
one modified sugar moiety selected from the group including but not limited to 
30 arabinose, 2-fluoroarabinose, xylulose, and hexose. 

In yet another embodiment, the subject nucleic acid may include at least 
one modified phosphate backbone selected from the group consisting of a 
phosphorothioate, a phosphorodithioate, a phosphoramidothioate, a 
phosphoramidate, a phosphordiamidate, a methylphosphonate, an alkyl 
35 phosphotriester, and a formacetal or analog thereof. 
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II. Recombinant Expression Vectors and Host Cells 

The present invention also provides for vectors containing the subject 
nucleic acid molecules. As used herein, the term "vector" refers to a nucleic acid 
molecule capable of transporting another nucleic acid to which it has been 
5 linked. One type of preferred vector is an episome, i.e., a nucleic acid capable of 
extra-chromosomal replication. Preferred vectors are those capable of 
autonomous replication and/expression of nucleic acids to which they are linked. 
Vectors capable of directing the expression of genes to which they are 
operatively linked are referred to herein as "expression vectors". In general, 

1 0 expression vectors of utility in recombinant DNA techniques are often in the 

form of "plasmids" which refer generally to circular double stranded DNA loops 
which, in their vector form are not bound to the chromosome. In the present 
specification, "plasmid" and "vector" are used interchangeably as the plasmid is 
the most commonly used form of vector. However, the invention is intended to 

15 include such other forms of expression vectors which serve equivalent functions. 

This invention also provides expression vectors containing a nucleic acid 
encoding a DEF polypeptide, operatively linked to at least one transcriptional 
regulatory sequence. "Operatively linked" is intended to mean that the 
nucleotide sequence is linked to a regulatory sequence in a manner which allows 

20 expression of the nucleotide sequence. Transcriptional regulatory sequences are 
art-recognized and are selected to direct expression of the subject mammalian 
DEF proteins. Exemplary regulatory sequences are described in Goeddel; Gene 
Expression Technology: Methods in Enzymology 185, Academic Press, San 
Diego, CA (1990). 

25 In a preferred embodiment the expression vector of the present invention 

is capable of replicating in a cell. In one embodiment, the expression vector 
includes a recombinant gene encoding a peptide having DEF bioactivity. Such 
expression vectors can be used to transfect cells and thereby produce 
polypeptides, including fusion proteins, encoded by nucleic acids as described 

30 herein. Moreover, the gene constructs of the present invention can also be used 
as a part of a gene therapy protocol to deliver nucleic acids encoding either an 
agonistic or antagonistic form of one of the subject mammalian DEF proteins. 
Thus, another aspect of the invention features expression vectors for in vivo or in 
vitro transfection and expression of a mammalian DEF polypeptide in particular 

35 cell types so as to reconstitute the function of, or alternatively, abrogate the 
function of DEF in a tissue. For example, DEF or fragments thereof may be 
expressed in a cell in order to induce growth arrest and/or terminal differentiation 
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of a proliferating cell, e.g., a cancer cell. As an illustrative embodiment, 
transfected DEF may induce growth arrest of an adipocyte cell or a neuronal cell. 
Alternatively, inhibition of the cell proliferation in a subject can be obtained by 
abrogate the function of DEF in therapeutic intervention in diseases as cancer. In 
5 another embodiment, DEF or fragments thereof may be expressed in a 
mammalian cell, e.g., an adipocyte or a neural cell. 

In addition to viral transfer methods, such as those described above, non- 
viral methods can also be employed to cause expression of a subject DEF 
polypeptide in the tissue of an animal. Most nonviral methods of gene transfer 

1 0 rely on normal mechanisms used by mammalian cells for the uptake and 

intracellular transport of macromolecules. In preferred embodiments, non-viral 
targeting means of the present invention rely on endocytic pathways for the 
uptake of the subject DEF polypeptide gene by the targeted cell. Exemplary 
targeting means of this type include liposomal derived systems, poly-lysine 

1 5 conjugates, and artificial viral envelopes. 

The recombinant DEF genes can be produced by ligating nucleic acid 
encoding a DEF protein, or a portion thereof, into a vector suitable for expression 
in either prokaryotic cells, eukaryotic cells, or both. Expression vectors for 
production of recombinant forms of the subject DEF polypeptides include 

20 piasmids and other vectors. For instance, suitable vectors for the expression of a 
DEF polypeptide include piasmids of the types: pBR322-derived piasmids, 
pEMBL-derived piasmids, pEX-derived piasmids, pBTac-derived piasmids and 
pUC-derived piasmids for expression in prokaryotic cells, such as E. coli. 

A number of vectors exist for the expression of recombinant proteins in 

25 yeast For instance, YEP24, YIPS, YEP51, YEP52, pYES2, and YRP17 are 
cloning and expression vehicles useful in the introduction of genetic constructs 
into S. cerevisiae (see, for example, Broach et al. (1983) in Experimental 
Manipulation of Gene Expression, ed. M. Inouye Academic Press, p. 83, 
incorporated by reference herein). These vectors can replicate in E. coli due the 

30 presence of the pBR322 ori, and in S. cerevisiae due to the replication 

determinant of the yeast 2 micron plasmid. In addition, drug resistance markers 
such as ampicillin can be used. In an illustrative embodiment, a DEF 
polypeptide is produced recombinantly utilizing an expression vector generated 
by sub-cloning the coding sequence of one of the DEF genes represented in SEQ 

35 ID NO:l, SEQ. ID NO:l, SEQ ID NO: 3 or SEQ ID NO: 5, SEQ ID NO: 6 or 
SEQ ID NO: 8, or SEQ ID NO: 9 or SEQ ID NO: 1 1 . 
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The preferred mammalian expression vectors contain both prokaryotic 
sequences, to facilitate the propagation of the vector in bacteria, and one or more 
eukaryotic transcription units that are expressed in eukaryotic cells. The 
pcDNAI/amp, pcDNAI/neo, pRc/CMV, pSV2gpt, pSV2neo, pSV2-dhfr, pTk2, 
5 pRSVneo, pMSG, pSVT7, pko-neo and pHyg derived vectors are examples of 
mammalian expression vectors suitable for transfection of eukaryotic cells. 
Some of these vectors are modified with sequences from bacterial plasmids, such 
as pBR322, to facilitate replication and drug resistance selection in both 
prokaryotic and eukaryotic cells. Alternatively, derivatives of viruses such as the 

1 0 bovine papillomavirus (BPV- 1 ), or Epstein-Barr virus (pHEBo, pREP-derived 
and p205) can be used for transient expression of proteins in eukaryotic cells. 
The various methods employed in the preparation of the plasmids and 
transformation of host organisms are well known in the art. For other suitable 
expression systems for both prokaryotic and eukaryotic cells, as well as general 

15 recombinant procedures, see Molecular Cloning A Laboratory Manual, 2nd Ed., 
ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 
1989) Chapters 16 and 17. 

In some instances, it may be desirable to express the recombinant DEF 
polypeptide by the use of a baculovirus expression system. Examples of such 

20 baculovirus expression systems include pVL-derived vectors (such as pVLl 392, 
pVL1393 and pVL941), pAcUW-derived vectors (such as pAcUWl), and 
pBlueBac-derived vectors (such as the fl-gal containing pBlueBac III). 

In some cases it will be desirable to express only a portion of a DEF 
protein. The subject vectors can also include fragments of a DEF nucleic acid 

25 encoding a fragment of a DEF protein. In a preferred embodiment, subdomains 
of a DEF protein are expressed. 

The subject vectors can be used to transfect a host cell in order to express 
a recombinant form of the subject DEF polypeptides. The host cell may be any 
prokaryotic or eukaryotic cell. Thus, a nucleotide sequence derived from the 

30 cloning of mammalian DEF proteins, encoding all or a selected portion of the 
full-length protein, can be used to produce a recombinant form of a mammalian 
DEF polypeptide in a cell. 

"Cells," "host cells" or "recombinant host cells" are terms used 
interchangeably herein. It is understood that such terms refer not only to the 

35 particular subject cell but to the progeny or potential progeny of such a cell. 

Because certain modifications may occur in succeeding generations due to either 
mutation or environmental influences, such progeny may not, in fact, be identical 
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to the parent cell, but are still included within the scope of the term as used 
herein. 

The present invention further pertains to methods of producing the 
subject DEF. polypeptides. For example, a host cell transfected with a nucleic 
5 acid vector directing expression of a nucleotide sequence encoding the subject 
polypeptides can be cultured under appropriate conditions to allow expression of 
the peptide to occur. The cells may be harvested, lysed and the protein isolated. 
A cell culture includes host cells, media and other byproducts. Suitable media 
for cell culture are well known in the art. The recombinant DEF polypeptide can 

10 be isolated from cell culture medium, host cells, or both using techniques known 
in the art for purifying proteins including ion-exchange chromatography, gel 
filtration chromatography, ultrafiltration, electrophoresis, and immunoaffinity 
purification with antibodies specific for such peptide. In a preferred 
embodiment, the recombinant DEF polypeptide is a fusion protein containing a 

1 5 domain which facilitates its purification, such as GST fusion protein or poly (His) 
fusion protein. 

In other embodiments transgenic animals, described in more detail below 
could be used to produce recombinant proteins. 

The present invention also provides for a recombinant transfection 

20 system, including a DEF gene construct operatively linked to a transcriptional 
regulatory sequence and a gene delivery composition for delivering the gene 
construct to a cell so that the cell expresses the DEF protein. 

As used herein, the term "transfection" means the introduction of a 
nucleic acid, e.g., an expression vector, into a recipient cell by nucleic acid- 

25 mediated gene transfer. "Transformation", as used herein, refers to a process in 
which a cell's genotype is changed as a result of the cellular uptake of exogenous 
DN A or RNA, and, for example, the transformed cell expresses a recombinant 
form of a mammalian DEF polypeptide or, in the case of anti-sense expression 
from the transferred gene, the expression of a naturally-occurring form of the 

30 DEF protein is disrupted. 

A "delivery composition" shall mean a targeting means (e.g. a molecule 
that results in higher affinity binding of a gene, protein, polypeptide or peptide to 
a target cell surface and/or increased cellular uptake by a target cell). Examples 
of targeting means include: sterols (e.g. cholesterol), lipids (e.g. a cationic lipid, 

35 virosome or liposome), viruses (e.g. adenovirus, adeno-associated virus, and 

retrovirus) or target cell specific binding agents (e.g. ligands recognized by target 
cell specific receptors). 



i 
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III. Polypeptides 

The present invention further pertains to isolated and/or recombinant 
forms of a DEF polypeptide. The terms "protein", "polypeptide" and "peptide" 
5 are used interchangably herein. 

The term "recombinant protein" refers to a polypeptide of the present 
invention which is produced by recombinant DNA techniques, wherein 
generally, DNA encoding a mammalian DEF polypeptide is inserted into a 
suitable expression vector which is in turn used to transform a host cell to 

1 0 produce the heterologous protein. Moreover, the phrase "derived from", with 
respect to a recombinant DEF gene, is meant to include within the meaning of 
"recombinant protein" those proteins having an amino acid sequence of a native 
DEF protein, or a similar amino acid sequence which is generated by mutations 
including substitutions and deletions (including truncation) of a naturally 

1 5 occurring form of the protein. 

The present invention also makes available isolated DEF polypeptides 
which are isolated from, or otherwise substantially free from other cellular 
proteins, especially other factors which may normally be associated with the 
DEF polypeptide. The term "substantially free of other cellular proteins" (also 

20 referred to herein as "contaminating proteins") or "substantially pure or purified 
preparations" are defined as encompassing preparations of DEF polypeptides 
having less than about 20% (by dry weight) contaminating protein, and 
preferably having less than about 5% contaminating protein. Functional forms of 
the subject polypeptides can be prepared, for the first time, as purified 

25 preparations by using a cloned gene as described herein. By "purified", it is 

meant, when referring to a peptide or DNA or RNA sequence, that the indicated 
molecule is present in the substantial absence of other biological 
macromolecules, such as other proteins. The term "purified" as used herein 
preferably means at least 80% by dry weight, more preferably in the range of 95- 

30 99% by weight, and most preferably at least 99.8% by weight, of biological 
macromolecules of the same type present (but water, buffers, and other small 
molecules, especially molecules having a molecular weight of less than 5000, 
can be present). The term "pure" as used herein preferably has the same 
numerical limits as "purified" immediately above. "Isolated" and "purified" are 

35 not meant to encompass either natural materials in their native state or natural 
materials that have been separated into components (e.g., in an acrylamide gel) 
but not obtained either as pure (e.g. lacking contaminating proteins, or 
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chromatography reagents such as denaturing agents and polymers, e.g. 
acrylamide or agarose) substances or solutions. In preferred embodiments, 
purified DEF preparations will lack any contaminating proteins from the same 
animal from which DEF is normally produced, as can be accomplished by 
5 recombinant expression of, for example, a human DEF protein in a non-human 
cell. 

In a particularly preferred embodiment a DEF protein includes the amino 
acid sequence shown in SEQ ID NO:2, SEQ ID NO: 4, SEQ ID NO: 7, or SEQ 
ID NO: 10. In particularly preferred embodiments, a DEF protein has a DEF 
10 bioactivity. 

The present invention also provides for DEF proteins which have amino 
acid sequences evolutionarily related to the DEF proteins represented in SEQ ID 
NO:2, SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10. In a preferred 
embodiment, a DEF protein of the present invention is a mammalian DEF 

1 5 protein. The term "evolutionarily related to", with respect to amino acid 
sequences of mammalian DEF proteins, refers to both polypeptides having 
amino acid sequences which have arisen naturally, and also to mutational 
variants of mammalian DEF polypeptides which are derived, for example, by 
combinatorial mutagenesis. Such evolutionarily derived DEF polypeptides 

20 preferred by the present invention have a DEF bioactivity and are at least 90% 
homologous and most preferably at least 95% homologous with the amino acid 
sequence shown in SEQ ID NO:2, SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID 
NO: 10. 

In certain embodiments it will be advantageous to provide homologs of 
25 one of the subject DEF polypeptides which function in a limited capacity as one 
of either a DEF agonist (mimetic) or a DEF antagonist, in order to promote or 
inhibit only a subset of the biological activities of the naturally-occuning form of 
the protein. Thus, specific biological effects can be elicited by treatment with a 
homolog of limited function, and with fewer side effects relative to treatment 
30 with agonists or antagonists which are directed to all of the biological activities 
of naturally occurring forms of DEF proteins. 

Homologs of each of the subject DEF proteins can be generated by 
mutagenesis, such as by discrete point mutation(s), or by truncation. For 
instance, mutation can give rise to homologs which retain substantially the same, 
35 or merely a subset, of the biological activity of the DEF polypeptide from which 
it was derived. Alternatively, antagonistic forms of the protein can be generated 
which are able to inhibit the function of the naturally occurring form of the 
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protein, such as by competitively binding to a downstream or upstream member 
of the DEF cascade which includes the DEF protein. In addition, agonistic forms 
of the protein may be generated which are constitutively active. Thus, the 
mammalian DEF protein and homologs thereof provided by the subject invention 
5 may be either positive or negative regulators of cell proliferation or 
differentiation. 

The recombinant DEF polypeptides of the present invention also include 
homologs of the wild type DEF proteins, such as versions of those protein which 
are resistant to proteolytic cleavage, as for example, due to mutations which alter 

1 0 ubiquitination or other enzymatic targeting associated with the protein. 

DEF polypeptides may also be chemically modified to create DEF 
derivatives by forming covalent or aggregate conjugates with other chemical 
moieties, such as glycosyl groups, lipids, phosphate, acetyl groups and the like. 
Covalent derivatives of DEF proteins can be prepared by linking the chemical 

1 5 moieties to functional groups on amino acid sidechains of the protein or at the N- 
terminus or at the C-terminus of the polypeptide. 

Modification of the structure of the subject mammalian DFF 
polypeptides can be for such purposes as enhancing therapeutic or prophylactic 
efficacy, stability (e.g., ex vivo shelf life and resistance to proteolytic degradation 

20 in vrvo), or post-translational modifications (e.g., to alter phosphorylation pattern 
of protein). Such modified peptides, when designed to retain at least one activity 
of the naturally-occurring form of the protein, or to produce specific antagonists 
thereof, are considered functional equivalents of the DEF polypeptides described 
in more detail herein. Such modified peptides can be produced, for instance, by 

25 amino acid substitution, deletion, or addition. 

For example, it is reasonable to expect that an isolated replacement of a 
leucine with an isoleucine or valine, an aspartate with a glutamate, a threonine 
with a serine, or a similar replacement of an amino acid with a structurally 
related amino acid (i.e. isosteric and/or isoelectric mutations) will not have a 

30 major effect on the biological activity of the resulting molecule. Conservative 
replacements are those that take place within a family of amino acids that are 
related in their side chains. Genetically encoded amino acids are can be divided 
into four families: (1) acidic = aspartate, glutamate; (2) basic = lysine, arginine, 
histidine; (3) nonpolar = alanine, valine, leucine, isoleucine, proline, 

35 phenylalanine, methionine, tryptophan; and (4) uncharged polar = glycine, 

asparagine, glutamine, cysteine, serine, threonine, tyrosine. In similar fashion, 
the amino acid repertoire can be grouped as (1) acidic = aspartate, glutamate; (2) 
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basic = lysine, arginine histidine, (3) aliphatic - glycine, alanine, valine, leucine, 
isoleucine, serine, threonine, with serine and threonine optionally be grouped 
separately as aliphatic-hydroxyl; (4) aromatic = phenylalanine, tyrosine, 
tryptophan; (5) amide = asparagine, glutamine; and (6) sulfur -containing = 
5 cysteine and methionine, (see, for example, Biochemistry, 2nd ed., Ed. by L. 
Stryer, WH Freeman and Co.: 1981). Whether a change in the amino acid 
sequence of a peptide results in a functional DEF homolog (e.g. functional in the 
sense that the resulting polypeptide mimics or antagonizes the wild-type form) 
can be readily determined by assessing the ability of the variant peptide to 

1 0 produce a response in cells in a fashion similar to the wild-type protein, or 
competitively inhibit such a response. Polypeptides in which more than one 
replacement has taken place can readily be tested in the same manner. 

In another embodiment a DEF has a DEF bioactivity and is encoded by 
the nucleic acid shown in Figure 2 (SEQ. ID NO:l), Figure 13 (SEQ ID NO: 3 or 

15 SEQ ID NO: 5), Figure 14 (SEQ ID NO: 6 or SEQ ID NO: 8), or Figure 1 5 (SEQ 
ID NO: 9 or SEQ ID NO: 11). 

Full length proteins or fragments corresponding to one or more particular 
motifs and/or domains or to arbitrary sizes, for example, at least amino acids in 
length are within the scope of the present invention. For example, isolated DEF 

20 polypeptides can include all or a portion of an amino acid sequences 

corresponding to a DEF polypeptide represented in or homologous to Figure 3 
(SEQ ID NO:2), Figure 12 (SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10). 
Isolated peptidyl portions of DEF proteins can be obtained by screening peptides 
recombinantly produced from the corresponding fragment of the nucleic acid 

25 encoding such peptides. In addition, fragments can be chemically synthesized 
using techniques known in the art such as conventional Merrifield solid phase f- 
Moc or t-Boc chemistry. For example, a DEF polypeptide of the present 
invention may be arbitrarily divided into fragments of desired length with no 
overlap of the fragments, or preferably divided into overlapping fragments of a 

30 desired length. The fragments can be produced (recombinantly or by chemical 
synthesis) and tested to identify those peptidyl fragments which can function as 
either agonists or antagonists of a wild-type (e.g., "authentic") DEF protein. 

In still a further embodiment an isolated or recombinant DEF polypeptide 
includes a sequence corresponding to a src SH3 consensus binding sequence 

35 (794-799, 803-809, 829-835, 895-901 or 993-999 of SEQ ID NO:2; 829-833, 

892-898 or 1005-101 1 of SEQ ID NO: 4; 777-782, 822-828 of SEQ ID NO: 7; or 
780-785, 829-834, 834-840, 867-873 of SEQ ID NO: 10), and is at least 85%, 
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more preferably about 90%, and most preferably at least about 91, 92, 93, 94, 95, 
96, 97, 98, 99% identical to a a src SH3 consensus binding sequence of the 
amino acid sequence shown in Figure 3 (SEQ ID NO:2), Figure 12 (SEQ ID NO: 
4, SEQ ID NO: 7, or SEQ ID NO: 10). 
5 In still a further embodiment an isolated or recombinant DEF polypeptide 

includes a sequence corresponding to a zinc finger domain (457-480 of SEQ ID 
NO:2; 454-477 of SEQ ID NO: 4, 436-459 of SEQ ID NO: 7, or 436-459 of SEQ 
ID NO: 10) and is at least 85%, more preferably about 90%, and most preferably 
at least about 91, 92, 93, 94, 95, 96, 97, 98, 99% identical to a zinc finger domain 

10 of the amino acid sequence shown in Figure 3 (SEQ ID NO:2), Figure 12 (SEQ 
ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10). 

In yet another embodiment an isolated or recombinant DEF polypeptide 
includes a sequence corresponding to an ankyrin repeat (356-374, 604-623, 640- 
659 and 672-692 of SEQ ID NO:2; 353-371, 601-620, 637-656 and 669-689 of 

15 SEQ ID NO: 4; 334-352, 585-604, 621-640 and 653-673 of SEQ ID NO: 7; or 
334-352, 584-603, 620-639 and 652-672 of SEQ ID NO: 10) and is at least 85%, 
more preferably about 90%, and most preferably at least about 91, 92, 93, 94, 95, 
96, 97, 9S, 99% identical to an ankyrin repeat of the amino acid sequence shown 
in Figure 3 (SEQ ID NO:2), Figure 12 (SEQ ID NO: 4, SEQ ID NO: 7, or SEQ 

20 ID NO: 10). 

In yet another embodiment an isolated or recombinant DEF polypeptide 
includes a sequence corresponding to a pleckstrin homology domain (326-419 of 
SEQ ID NO:2; 323-416 of SEQ ID NO: 4; 304-397 of SEQ ID NO: 7; or 303- 
397 of SEQ ID NO: 10) and is at least 85%, more preferably about 90%, and 

25 most preferably at least about 91 , 92, 93, 94, 95, 96, 97, 98, 99% identical to an 
ankyrin repeat of the amino acid sequence shown in Figure 3 (SEQ ID NO:2), 
Figure 12 (SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10). 

In yet another embodiment an isolated or recombinant DEF polypeptide 
includes a sequence corresponding to a C2 domain (498-557 of SEQ ID NO:2; 

30 495-554 of SEQ ID NO: 4; 477-537 of SEQ ID NO: 7; or 477-536 of SEQ ID 
NO: 10) and is at least 85%, more preferably about 90%, and most preferably at 
least about 91, 92, 93, 94, 95, 96, 97, 98, 99% identical to an ankyrin repeat of 
the amino acid sequence shown in Figure 3 (SEQ ID NO:2), Figure 12 (SEQ ID 
NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10). 

35 In another embodiment an isolated or recombinant DEF polypeptide 

includes a sequence corresponding to a proline-rich repeat (934-1001 of SEQ ID 
NO:2; or 944-1013 of SEQ ID NO: 4) and is at least 85%, more preferably about 
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90%, and most preferably at least about 91, 92, 93, 94, 95, 96, 97, 98, 99% 
identical to a proline-rich repeat of the amino acid sequence shown in Figure 3 
(SEQ ID NO:2), Figure 12 (SEQ ID NO: 4). 

In yet another embodiment an isolated or recombinant DEF polypeptide 
5 includes a sequence corresponding to an SH3 domain ( 1 073- 1 1 23 of SEQ ID 
NO:2; 1095-1 145 of SEQ ID NO: 4; or 926-976 of SEQ ID NO: 7) and is at least 
85%, more preferably about 90%, and most preferably at least about 91, 92, 93, 
94, 95, 96, 97, 98, 99% identical to an SH3 domain of the amino acid sequence 
shown in Figure 3 (SEQ ID NO:2), Figure 12 (SEQ ID NO: 4, or SEQ ID NO: 
10 7). 

In certain preferred embodiments, the invention features a purified or 
recombinant DEF polypeptide having a molecular weight of approximately 135- 
145kD. It will be understood that certain post-translational modifications can 
increase the apparent molecular weight of the DEF protein relative to the 

1 5 unmodified polypeptide chain. 

This invention further provides a method for generating sets of 
combinatorial mutants of the subject DEF proteins as well as truncation mutants, 
and is especially useful for identifying potential variant sequences (e.g. 
homologs) that modulate a DEF bioactivity. The purpose of screening such 

20 combinatorial libraries is to generate, for example, novel DEF homologs which 
can act as either agonists or antagonist, or alternatively, possess novel activities 
all together. To illustrate, combinatorially-derived homologs can be generated to 
have an increased potency relative to a naturally occurring form of the protein. 
Likewise, DEF homologs can be generated by the present combinatorial 

25 approach to selectively inhibit (antagonize) an authentic DEF. For instance, 
mutagenesis can provide DEF homologs which are able to bind other signal 
pathway proteins (or DNA) yet prevent propagation of the signal, e.g. the 
homologs can be dominant negative mutants. Moreover, manipulation of certain 
domains of DEF by the present method can provide domains more suitable for 

30 use in fusion proteins. 

In one embodiment, the variegated library of DEF variants is generated 
by combinatorial mutagenesis at the nucleic acid level, and is encoded by a 
variegated gene library. For instance, a mixture of synthetic oligonucleotides can 
be enzymatically ligated into gene sequences such that the degenerate set of 

35 potential DEF sequences are expressible as individual polypeptides, or 

alternatively, as a set of larger fusion proteins (e.g. for phage display) containing 
the set of DEF sequences therein. 
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There are many ways by which such libraries of potential DEF homologs 
can be generated from a degenerate oligonucleotide sequence. Chemical 
synthesis of a degenerate gene sequence can be carried out in an automatic DNA 
synthesizer, and the synthetic genes then ligated into an appropriate expression 
5 vector. The purpose of a degenerate set of genes is to provide, in one mixture, all 
of the sequences encoding the desired set of potential DEF sequences. The 
synthesis of degenerate oligonucleotides is well known in the art (see for 
example, Narang, SA (1983) Tetrahedron 39:3; Itakura et al. (1981) 
Recombinant DNA, Proc 3rd Cleveland Sympos. Macromolecules, ed. AG 

1 0 Walton, Amsterdam: Elsevier pp273-289; Itakura et al. ( 1 984) Annu. Rev. 
Biochem. 53:323; Itakura et al. (1984) Science 198:1056; Ike et al. (1983) 
Nucleic Acid Res. 1 1 'All. Such techniques have been employed in the directed 
evolution of other proteins (see, for example, Scott et al. (1990) Science 
249:386-390; Roberts et al. (1992) PNAS 89:2429-2433; Devlin et al. (1990) 

15 Science 249: 404-406; Cwirla et al. (1990) PNAS 87: 6378-6382; as well as U.S. 
Patents Nos. 5,223,409, 5,198,346, and 5,096,815). 

Likewise, a library of coding sequence fragments can be provided for a 
DEF clone in order to generate a variegated population of DEF fragments for 
screening and subsequent selection of bioactive fragments. A variety of 

20 techniques are known in the art for generating such libraries, including chemical 
synthesis. In one embodiment, a library of coding sequence fragments can be 
generated by (i) treating a double stranded PCR fragment of a DEF coding 
sequence with a nuclease under conditions wherein nicking occurs only about 
once per molecule; (ii) denaturing the double stranded DNA; (iii) renaturing the 

25 DNA to form double stranded DNA which can include sense/anti sense pairs 
from different nicked products; (iv) removing single stranded portions from 
reformed duplexes by treatment with SI nuclease; and (v) ligating the resulting 
fragment library into an expression vector. By this exemplary method, an 
expression library can be derived which codes for N-terminal, C-terminal and 

30 internal fragments of various sizes. 

A wide range of techniques are known in the art for screening gene 
products of combinatorial libraries made by point mutations or truncation, and 
for screening cDNA libraries for gene products having a certain property. Such 
techniques will be generally adaptable for riapid screening of the gene libraries 

35 generated by the combinatorial mutagenesis of DEF homologs. The most widely 
used techniques for screening large gene libraries typically comprises cloning the 
gene library into replicable expression vectors, transforming appropriate cells 
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with the resulting library of vectors, and expressing the combinatorial genes 
under conditions in which detection of a desired activity facilitates relatively 
easy isolation of the vector encoding the gene whose product was detected. Each 
of the illustrative assays described below are amenable to high through-put 
5 analysis as necessary to screen large numbers of degenerate DEF sequences 
created by combinatorial mutagenesis techniques. 

In one embodiment, cell based assays can be exploited to analyze the 
variegated DEF library. For instance, the library of expression vectors can be 
transfected into a cell line ordinarily responsive to DEF. The transfected cells 

10 are then exposed to an extracellular signal and the effect of the DEF mutant can 
be detected, e.g. G protein activity, e.g., GTPase activity. Plasmid DNA can 
then be recovered from the cells which score for inhibition, or alternatively, 
potentiation of DEF activity, and the individual clones further characterized. 

Combinatorial mutagenesis has a potential to generate very large libraries 

15 of mutant proteins, e.g., in the order of 10^6 molecules. Combinatorial libraries 
of this size may be technically challenging to screen even with high throughput 
screening assays. To overcome this problem, a new technique has been 
developed recently, recrusive ensemble mutagenesis (REM), which allows one to 
avoid the very high proportion of non-functional proteins in a random library and 

20 simply enhances the frequency of functional proteins, thus decreasing the 

complexity required to achieve a useful sampling of sequence space. REM is an 
algorithm which enhances the frequency of functional mutants in a library when 
an appropriate selection or screening method is employed (Arkin and Yourvan, 
1992, PNAS USA 89:781 1-7815; Yourvan et al., 1992, Parallel Problem Solving 

25 from Nature, 2., In Maenner and Manderick, eds., Elsevir Publishing Co., 

Amsterdam, pp. 401-410; Delgrave et al., 1993, Protein Engineering 6(3):327- 
331). 

The invention also provides for reduction of the mammalian DEF 
proteins to generate mimetics, e.g. peptide or non-peptide agents, which are able 

30 to disrupt binding of a mammalian DEF polypeptide of the present invention 
with binding proteins or interactors. Thus, such mutagenic techniques as 
described above are also useful to map the determinants of the DEF proteins 
which participate in protein-protein interactions involved in, for example, 
binding of the subject mammalian DEF polypeptide to proteins which may 

35 function upstream (including both activators and repressors of its activity) or to 
proteins or nucleic acids which may function downstream of the DEF 
polypeptide, whether they are positively or negatively regulated by it To 
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illustrate, the critical residues of a subject DEF polypeptide which are involved 
in molecular recognition of interactor proteins or molecules upstream or 
downstream of a DEF (such as, for example, a src SH3 binding site, a zinc finger 
domain, an ankyrin repeat) can be determined and used to generate DEF-derived 
5 peptidomimetics which competitively inhibit binding of the authentic DEF 
protein to that moiety. By employing, for example, scanning mutagenesis to 
map the amino acid residues of each of the subject DEF proteins which are 
involved in binding other intracellular proteins, peptidomimetic compounds can 
be generated which mimic those residues of the DEF protein which facilitate the 

10 interaction. Such mimetics may then be used to interfere with the normal 

function of a DEF protein. For instance, non-hydrolyzable peptide analogs of 
such residues can be generated using benzodiazepine (e.g., see Freidinger et al. in 
Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, 
Netherlands, 1988), azepine (e.g., see Huffman et al. in Peptides: Chemistry and 

15 Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), 
substituted y lactam rings (Garvey et al. in Peptides: Chemistry and Biology, 
G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), keto- 
methylene pseudopeptides (Ewenson et al. (1986) J Med Chem 29:295; and 
Ewenson et al. in Peptides: Structure and Function (Proceedings of the 9th 

20 American Peptide Symposium) Pierce Chemical Co. Rockland, IL, 1985), b-turn 
dipeptide cores (Nagai et al. (1 985) Tetrahedron Lett 26:647; and Sato et al. 
(1986) J Chem Soc Perkin Trans 1 :1231), and b-aminoalcohols (Gordon et al. 
(1985) Biochem Biophys Res Communl26:419; and Dann et al. (1986) Biochem 
Biophys Res Commun 1 34:7 1 ). 

25 In another embodiment, the coding sequences for the polypeptide can be 

incorporated as a part of a fusion gene including a nucleotide sequence encoding 
a different polypeptide to generate a fusion protein or chimeric protein. 

A "chimeric protein" or "fusion protein" is a fusion of a first amino acid 
sequence encoding one of the subject mammalian DEF polypeptides with a 

30 second amino acid sequence defining a domain (e.g. polypeptide portion) foreign 
to and not substantially homologous with any domain of one of the mammalian 
DEF proteins. A chimeric protein may present a foreign domain which is found 
(albeit in a different protein) in an organism which also expresses the first 
protein, or it may be an "interspecies", "intergenic", etc. fusion of protein 

35 structures expressed by different kinds of organisms. In general, a fusion protein 
can be represented by the general formula X-DEF-Y, wherein DEF represents a 
portion of the protein which is derived from one of the mammalian DEF 
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proteins, and X and Y are independently absent or represent amino acid 
sequences which are not related to one of the mammalian DEF sequences in an 
organism, including naturally occurring mutants. 

Fusion proteins can also facilitate the expression of proteins, and 
5 accordingly, can be used in the expression of the mammalian DEF polypeptides 
of the present invention. For example, DEF polypeptides can be generated as 
glutathione-S-transferase (GST-fusion) proteins. Such GST-fusion proteins can 
enable easy purification of the DEF polypeptide, as for example by the use of 
glutathione-derivatized matrices (see, for example, Current Protocols in 

10 Molecular Biology, eds. Ausubel et al. (N.Y.: John Wiley & Sons, 1991)). 

In another embodiment, a fusion gene coding for a purification leader 
sequence, such as a poly-(His)/enterokinase cleavage site sequence at the N- 
terminus of the desired portion of the recombinant protein, can allow purification 
of the expressed fusion protein by affinity chromatography using a Ni2+ metal 

1 5 resin. The purification leader sequence can then be subsequently removed by 
treatment with enterokinase to provide the purified protein (e.g., see Hochuli et 
al. (1987) J. Chromatography 41 1:177; and Janknecht et al. PNAS 88:8972). 

Techniques for making fusion genes are known to those skilled in the art. 
Essentially, the joining of various DNA fragments coding for different 

20 polypeptide sequences is performed in accordance with conventional techniques, 
employing blunt-ended or stagger-ended termini for ligation, restriction enzyme 
digestion to provide for appropriate termini, filling-in of cohesive ends as 
appropriate, alkaline phosphatase treatment to avoid undesirable joining, and 
enzymatic ligation. In another embodiment, the fusion gene can be synthesized 

25 by conventional techniques including automated DNA synthesizers. 

Alternatively, PCR amplification of gene fragments can be carried out using 
anchor primers which give rise to complementary overhangs between two 
consecutive gene fragments which can subsequently be annealed to generate a 
chimeric gene sequence (see, for example, Current Protocols in Molecular 

30 Biology, eds. Ausubel et al. John Wiley & Sons: 1992). 

In preferred embodiments, fusion proteins of the present invention 
contain a detectable label or a matrix binding domain. 

The preparation of fusion proteins is often desirable when producing an 
immunogenic fragment of a DEF protein. For example, the VP6 capsid protein 

35 of rotavirus can be used as an immunologic carrier protein for portions of the 
DEF polypeptide, either in the monomeric form or in the form of a viral particle. 
The nucleic acid sequences corresponding to the portion of a subject DEF protein 
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to which antibodies are to be raised can be incorporated into a fusion gene 
construct which includes coding sequences for a late vaccinia virus structural 
protein to produce a set of recombinant viruses expressing fusion proteins 
comprising DEF epitopes as part of the virion. It has been demonstrated with the 
5 use of immunogenic fusion proteins utilizing the Hepatitis B surface antigen 
fusion proteins that recombinant Hepatitis B virions can be utilized in this role as 
well. Similarly, chimeric constructs coding for fusion proteins containing a 
portion of a DEF protein and the poliovirus capsid protein can be created to 
enhance immunogenicity of the set of polypeptide antigens (see, for example, EP 

10 Publication No: 0259149; and Evans et al. (1989) Nature 339:385; Huang et al. 
(1988) J. Virol. 62:3855; and Schlienger et al. (1992) J. Virol. 66:2). 

The Multiple Antigen Peptide system for peptide-based immunization 
can also be utilized to generate an immunogen, wherein a desired portion of a 
DEF polypeptide is obtained directly from organo-chemical synthesis of the 

1 5 peptide onto an oligomeric branching lysine core (see, for example, Posnett et al. 
(1988) JBC 263:1719 and Nardelli et al. (1992) J. Immunol. 148:914). 
Antigenic determinants of DEF proteins can also be expressed and presented by 
bacterial cells. 

20 IV. Antibodies 

Another aspect of the invention pertains to an antibody specifically 
reactive with a mammalian DEF protein. For example, by using immunogens 
derived from a DEF protein, e.g. based on the cDNA sequences, anti- 
protein/anti-peptide antisera or monoclonal antibodies can be made by standard 

25 protocols (See, for example, Antibodies: A Laboratory Manual ed. by Harlow 
and Lane (Cold Spring Harbor Press: 1988)). A mammal, such as a mouse, a 
hamster or rabbit can be immunized with an immunogenic form of the peptide 
(e.g., a mammalian DEF polypeptide or an antigenic fragment which is capable 
of eliciting an antibody response, or a fusion protein as described above). 

30 Techniques for conferring immunogenicity on a protein or peptide include 
conjugation to carriers or other techniques well known in the art. An 
immunogenic portion of a DEF protein can be administered in the presence of 
adjuvant The progress of immunization can be monitored by detection of 
antibody titers in plasma or serum. Standard ELISA or other immunoassays can 

35 be used with the immunogen as antigen to assess the levels of antibodies. In a 
preferred embodiment, the subject antibodies are immunospecific for antigenic 
determinants of a DEF protein of a mammal, e.g. antigenic determinants of a 
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protein represented by Figure 3 (SEQ ID NO:2), Figure 12 (SEQ ID NO: 4, SEQ 
ID NO: 7, or SEQ ID NO: 10). 

Following immunization of an animal with an antigenic preparation of a 
DEF polypeptide, anti-DEF antisera can be obtained and, if desired, polyclonal 
5 anti-DEF antibodies isolated from the serum. To produce monoclonal 

antibodies, antibody-producing cells (lymphocytes) can be harvested from an 
immunized animal and fused by standard somatic cell fusion procedures with 
immortalizing cells such as myeloma cells to yield hybridoma cells. Such 
techniques are well known in the art, an include, for example, the hybridoma 

10 technique (originally developed by Kohler and Milstein, (1975) Nature, 256: 
495-497), the human B cell hybridoma technique (Kozbar et ah, (1983) 
Immunology Today, 4: 72), and the EBV-hybridoma technique to produce 
human monoclonal antibodies (Cole et al., (1 985) Monoclonal Antibodies and 
Cancer Therapy, Alan R. Liss, Inc. pp. 77-96). Hybridoma cells can be screened 

15 immunochemically for production of antibodies specifically reactive with a 

mammalian DEF polypeptide of the present invention and monoclonal antibodies 
isolated from a culture comprising such hybridoma cells. 

The term antibody as used herein is intended to include fragments thereof 
which are also specifically reactive with one of the subject mammalian DEF 

20 polypeptides. Antibodies can be fragmented using conventional techniques and 
the fragments screened for utility in the same manner as described above for 
whole antibodies. For example, F(ab)2 fragments can be generated by treating 
antibody with pepsin. The resulting F(ab)2 fragment can be treated to reduce 
disulfide bridges to produce Fab fragments. The antibody of the present 

25 invention is further intended to include bispecific and chimeric molecules having 
affinity for a DEF protein conferred by at least one CDR region of the antibody. 

Antibodies which specifically bind DEF epitopes can also be used in 
immunohistochemical staining of tissue samples in order to evaluate the 
abundance and pattern of expression of each of the subject DEF polypeptides. 

30 Anti-DEF antibodies can be used diagnostically in immuno-precipitation and 
immuno-blotting to detect and evaluate DEF protein levels in tissue as part of a 
clinical testing procedure. Likewise, the ability to monitor DEF protein levels in 
an individual can allow determination of the efficacy of a given treatment 
regimen for an individual afflicted with such a disorder. Diagnostic assays using 

35 anti-DEF antibodies can include, for example, immunoassays designed to aid in 
early diagnosis of a degenerative disorder, particularly ones which are manifest 
at birth. Diagnostic assays using anti-DEF polypeptide antibodies can also 
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include immunoassays designed to aid in early diagnosis and phenotyping 
neoplastic or hyperplastic disorders. 

Another application of anti-DEF antibodies of the present invention is in 
the immunological screening of cDNA libraries constructed in expression vectors 
5 such as Xgtl 1, A.gtl8-23, ^ZAP, and A.ORF8. Messenger libraries of this type, 
having coding sequences inserted in the correct reading frame and orientation, 
can produce fusion proteins. For instance, A,gtl 1 will produce fusion proteins 
whose amino termini consist of fi-galactosidase amino acid sequences and whose 
carboxy termini consist of a foreign polypeptide. Antigenic epitopes of a DEF 

10 protein, e.g. other orthologs of a particular DEF protein or other paralogs from 
the same species, can then be detected with antibodies, as, for example, reacting 
nitrocellulose filters lifted from infected plates with anti-DEF antibodies. 
Positive phage detected by this assay can then be isolated from the infected plate. 
Thus, the presence of DEF homologs can be detected and cloned from other 

15 animals, as can alternate isoforms (including splicing variants) from humans. 

In certain embodiment, it will be desirable to attach a label group to the 
subject antibodies to facilitate detection. One means for labeling an anti-DEF 
protein specific antibody is via linkage to an enzyme and use in an enzyme 
immunoassay (EI A) (Voller, "The Enzyme Linked Immunosorbent Assay 

20 (ELISA)", Diagnostic Horizons 2: 1 -7, 1 978, Microbiological Associates 

Quarterly Publication, Walkersville, MD; Voller, et aL, J. Clin. Pathol 31 :507- 
520 (1978); Butler, Meth. Enzymol. 73:482-523 (1981); Maggio, (ed.) Enzyme 
Immunoassay, CRC Press, Boca Raton, FL, 1980; Ishikawa, et aL, (eds!) 
Enzyme Immunoassay, Kgaku Shoin, Tokyo, 1981). The enzyme which is 

25 bound to the antibody will react with an appropriate substrate, preferably a 

chromogenic substrate, in such a manner as to produce a chemical moiety which 
can be detected, for example, by spectrophotometry, fluorimetric or by visual 
means. Enzymes which can be used to detectably label the antibody include, but 
are not limited to, malate dehydrogenase, staphylococcal nuclease, delta-5- 

30 steroid isomerase, yeast alcohol dehydrogenase, alpha-glycerophosphate, 
dehydrogenase, triose phosphate isomerase, horseradish peroxidase, alkaline 
phosphatase, asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, 
urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase and 
acetylcholinesterase. The detection can be accomplished by colorimetric 

35 methods which employ a chromogenic substrate for the enzyme. Detection may 
also be accomplished by visual comparison of the extent of enzymatic reaction of 
a substrate in comparison with similarly prepared standards. 
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Detection may also be accomplished using any of a variety of other 
immunoassays. For example, by radioactively labeling the antibodies or 
antibody fragments, it is possible to detect fingerprint gene wild type or mutant 
peptides through the use of a radioimmunoassay (RIA) (see, for example, 
5 Weintraub, B., Principles of Radioimmunoassays, Seventh Training Course on 
Radioligand Assay Techniques, The Endocrine Society, March, 1986, which is 
incorporated by reference herein). The radioactive isotope can be detected by 
such means as the use of a y counter or a scintillation counter or by 
autoradiography. 

10 It is also possible to label the antibody with a fluorescent compound. 

When the fluorescently labeled antibody is exposed to light of the proper wave 
length, its presence can then be detected. Among the most commonly used 
fluorescent labeling compounds are fluorescein isothiocyanate, rhodamine, 
phycoerythrin, phycocyanin, allophycocyanin, o-phthaldehyde and 

15 fluorescamine. 

The antibody can also be detectably labeled using fluorescence emitting 
metals such as 152Eu, or others of the lanthanide series. These metals can be 
attached to the antibody using such metal chelating groups as 
diethylenetriaminepentacetic acid (DTP A) or ethylenediaminetetraacetic acid 

20 (EDTA). 

The antibody also can be detectably labeled by coupling it to a 
chemiluminescent compound. The presence of the chemiluminescent-tagged 
antibody is then determined by detecting luminescence that arises during the 
course of a chemical reaction. Examples of particularly useful chemiluminescent 

25 labeling compounds are luminol, isoluminol, theromatic acridinium ester, 
imidazole, acridinium salt and oxalate ester. 

Likewise, a bioluminescent compound may be used to label the antibody 
of the present invention. Bioluminescence is a type of chemiluminescence found 
in biological systems in, which a catalytic protein increases the efficiency of the 

30 chemiluminescent reaction. The presence of a bioluminescent protein is 
determined by detecting the presence of luminescence. Important 
bioluminescent compounds for purposes of labeling are luciferin, luciferase and 
aequorin. 



35 



V. Pharmaceutical Preparations 

The subject modulating agents can be administered to a subject at 
therapeutically effective dose to treat or ameliorate a disorder benefiting from the 
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studies can be used in formulating a range of dosage for use in humans. The 
dosage of such compounds lies preferably within a range of circulating or tissue 
concentrations that include the ED50 with little or no toxicity. The dosage may 
5 vary within this range depending upon the dosage form employed and the route 
of administration utilized. For any compound used in the method of the 
invention, the therapeutically effective dose can be estimated initially from cell 
culture assays. A dose may be formulated in animal models to achieve a 
circulating plasma concentration range that includes the IC50 (i.e., the 

1 0 concentration of the test compound which achieves a half-maximal inhibition of 
symptoms) as determined in cell culture. Such information can be used to more 
accurately determine useful doses in humans. Levels in plasma may be 
measured, for example, by high performance liquid chromatography. 

In clinical settings, the gene delivery systems for the therapeutic DEF 

1 5 gene can be introduced into a patient by any of a number of methods, each of 
which is familiar in the art. For instance, a pharmaceutical preparation of the 
gene delivery system can be introduced systemically, e.g. by intravenous 
injection, and specific transduction of the protein in the target cells occurs 
predominantly from specificity of transfection provided by the gene delivery 

20 vehicle, cell-type or tissue-type expression due to the transcriptional regulatory 
sequences controlling expression of the receptor gene, or a combination thereof. 
In other embodiments, initial delivery of the recombinant gene is more limited 
with introduction into the animal being quite localized. For example, the gene 
delivery vehicle can be introduced by catheter (see U.S. Patent 5,328,470) or by 

25 stereotactic injection (e.g. Chen et al. (1994) PNAS 91 : 3054-3057). A 

mammalian DEF gene, such as any one of the sequences represented in SEQ ID 
NO: 1 , or a sequence homologous thereto can be delivered in a gene therapy 
construct by electroporation using techniques described, for example, by Dev et 
al. ((1994) Cancer Treat Rev 20:105-1 15). 

30 The pharmaceutical preparation of the gene therapy construct can consist 

essentially of the gene delivery system in an acceptable diluent, or can comprise 
a slow release matrix in which the gene delivery vehicle is imbedded. 
Alternatively, where the complete gene delivery system can be produced intact 
from recombinant cells, e.g. retroviral vectors, the pharmaceutical preparation 

35 can comprise one or more cells which produce the gene delivery system. 

Pharmaceutical preparations for use in accordance with the present 
invention may also be formulated in conventional manner using one or more 
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physiologically acceptable carriers or excipients. Thus, the compounds and their 
physiologically acceptable salts and solvates may be formulated for 
administration by, for example, injection, inhalation or insufflation (either 
through the mouth or the nose) or oral, buccal, parenteral or rectal 
5 administration. 

For such therapy, the compounds of the invention can be formulated for a 
variety of loads of administration, including systemic and topical or localized 
administration. Techniques and formulations generally may be found in 
Remmington's Pharmaceutical Sciences, Meade Publishing Co., Easton, PA. For 

1 0 systemic administration, injection is preferred, including intramuscular, 

intravenous, intraperitoneal, and subcutaneous. For injection, the oligomers of 
the invention can be formulated in liquid solutions, preferably in physiologically 
compatible buffers such as Hank's solution or Ringer's solution. In addition, the 
oligomers may be formulated in solid form and redissolved or suspended 

1 5 immediately prior to use. Lyophilized forms are also included. 

For oral administration, the pharmaceutical preparations may take the 
form of, for example, tablets or capsules prepared by conventional means with 
pharmaceutical^ acceptable excipients such as binding agents (e.g., 
pregelatinised maize starch, polyvinylpyrrolidone or hydroxypropyl 

20 methylcellulose); fillers (e.g., lactose, microcrystalline cellulose or calcium 
hydrogen phosphate); lubricants (e.g., magnesium stearate, talc or silica); 
disintegrants (e.g., potato starch or sodium starch glycolate); or wetting agents 
(e.g., sodium lauryl sulphate). The tablets may be coated by methods well 
known in the art. Liquid preparations for oral administration may take the form 

25 of, for example, solutions, syrups or suspensions, or they may be presented as a 
dry product for constitution with water or other suitable vehicle before use. Such 
liquid preparations may be prepared by conventional means with 
pharmaceutical^ acceptable additives such as suspending agents (e.g., sorbitol 
syrup, cellulose derivatives or hydrogenated edible fats); emulsifying agents 

30 (e.g., lecithin or acacia); non-aqueous vehicles (e.g., almond oil, oily esters, ethyl 
alcohol or fractionated vegetable oils); and preservatives (e.g., methyl or propyl- 
p-hydroxybenzoates or sorbic acid). The preparations may also contain buffer 
salts, flavoring, coloring and sweetening agents as appropriate. Preparations for 
oral administration may be suitably formulated to give controlled release of the 

35 active compound. 

For administration by inhalation, the preparations for use according to the 
present invention are conveniently delivered in the form of an aerosol spray 
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presentation from pressurized packs or a nebuliser, with the use of a suitable 
propellant, e.g., dichlorodifluoromethane, trichlorofluoromethane, 
dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In the case of a 
pressurized aerosol the dosage unit may be determined by providing a valve to 
5 deliver a metered amount. Capsules and cartridges of e.g. gelatin for use in an 
inhaler or insufflator may be formulated containing a powder mix of the 
compound and a suitable powder base such as lactose or starch. 

The compounds may be formulated for parenteral administration by 
injection, e.g., by bolus injection or continuous infusion. Formulations for 

1 0 injection may be presented in unit dosage form, e.g., in ampoules or in multi- 
dose containers, with an added preservative. The compositions may take such 
forms as suspensions, solutions or emulsions in oily or aqueous vehicles, and 
may contain formulatory agents such as suspending, stabilizing and/or dispersing 
agents. Alternatively, the active ingredient may be in powder form for 

1 5 constitution with a suitable vehicle, e.g., sterile pyrogen-free water, before use. 

The compounds may also be formulated in rectal compositions such as 
suppositories or retention enemas, e.g., containing conventional suppository 
bases such as cocoa butter or other glycerides. 

In addition to the formulations described previously, the compounds may 

20 also be formulated as a depot preparation. Such long acting formulations may be 
administered by implantation (for example subcutaneously or intramuscularly) or 
by intramuscular injection. Thus, for example, the compounds may be 
formulated with suitable polymeric or hydrophobic materials (for example as an 
emulsion in an acceptable oil) or ion exchange resins, or as sparingly soluble 

25 derivatives, for example, as a sparingly soluble salt 

Systemic administration can also be by transmucosal or transdermal 
means. For transmucosal or transdermal administration, penetrants appropriate 
to the barrier to be permeated are used in the formulation. Such penetrants are 
generally known in the art, and include, for example, for transmucosal 

30 administration bile salts and fiisidic acid derivatives. In addition, detergents may 
be used to facilitate permeation. Transmucosal administration may be through 
nasal sprays or using suppositories. For topical administration, the oligomers of 
the invention are formulated into ointments, salves, gels, or creams as generally 
known in the art. 

35 The compositions may, if desired, be presented in a pack or dispenser 

device, or as a kit with instructions. The composition may contain one or more 
unit dosage forms containing the active ingredient. The pack may for example 



WO 98/36065 



PCT/US98/02724 



-65- 

comprise metal or plastic foil, such as a blister pack. The pack or dispenser 
device may be accompanied by instructions for administration. 

VI. Transgenic animals 

5 The present invention also provides for transgenic animals in which 

expression of a genomic sequence encoding a functional DEF polypeptide is 
enhanced, induced, disrupted, prevented or suppressed. The transgenic animals 
produced in accordance with the present invention will include exogenous 
genetic material. As set out above, the exogenous genetic material will, in certain 

1 0 embodiments^ be a DN A sequence which results in the production of a DEF 
protein (either agonistic or antagonistic), and antisense transcript, or a DEF 
mutant. Further, in such embodiments the sequence will be attached to a 
transcriptional control element, e.g., a promoter, which preferably allows the 
expression of the transgene product in a specific type of cell. 

15 As used herein, the term "transgene" means a nucleic acid sequence 

(whether encoding or antisense to one of the mammalian DEF polypeptides), 
which is partly or entirely heterologous, i.e., foreign, to the transgenic animal or 
cell into which it is introduced, or, is homologous to an endogenous gene of the 
transgenic animal or cell into which it is introduced, but which is designed to be 

20 inserted, or is inserted, into the animal's genome in such a way as to alter the 

genome of the cell into which it is inserted (e.g., it is inserted at a location which 
differs from that of the natural gene or its insertion results in a knockout). A 
transgene can include one or more transcriptional regulatory sequences and any 
other nucleic acid, such as introns, that may be necessary for optimal expression 

25 of a selected nucleic acid. 

A "transgenic animal" refers to any animal, preferably a non-human 
mammal, bird or an amphibian, in which one or more of the cells of the animal 
contain heterologous nucleic acid introduced by way of human intervention, such 
as by transgenic techniques well known in the art. The nucleic acid is introduced 

30 into the cell, directly or indirectly by introduction into a precursor of the cell, by 
way of deliberate genetic manipulation, such as by microinjection or by infection 
with a recombinant virus. The term genetic manipulation does not include 
classical cross-breeding, or in vitro fertilization, but rather is directed to the 
introduction of a recombinant DNA molecule. This molecule may be integrated 

35 within a chromosome, or it may be extrachromosomally replicating DNA. In the 
typical transgenic animals described herein, the transgene causes cells to express 
a recombinant form of one of the mammalian DEF proteins, e.g. either agonistic 
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or antagonistic forms. However, transgenic animals in which the recombinant 
DEF gene is silent are also encompassed, as for example, the FLP or CRE 
recombinase dependent constructs described below. Moreover, "transgenic 
animal" also includes those recombinant animals in which gene disruption of one 
5 or more DEF genes is caused by human intervention, including both 
recombination and antisense techniques. 

The "non-human animals" of the invention include mammalians such as 
rodents, non-human primates, sheep, dog, cow, chickens, amphibians, reptiles, 
etc. Preferred non-human animals are selected from the rodent family including 

10 rat and mouse, most preferably mouse. The term "chimeric animal" is used 
herein to refer to animals in which the recombinant gene is found, or in which 
the recombinant is expressed in some but not all cells of the animal. The term 
"tissue-specific chimeric animal" indicates that one of the recombinant 
mammalian DEF genes is present and/or expressed or disrupted in some tissues 

15 but not others. 

These systems may be used in a variety of applications. For example, the 
cell- and animal-based model systems may be used to further characterize DEF 
genes and proteins. In addition, such assays may be utilized as part of screening 
strategies designed to identify compounds which are capable of ameliorating 

20 disease symptoms. Thus, the animal- and cell-based models may be used to 
identify drugs, pharmaceuticals, therapies and interventions which may be 
effective in treating disease. 

One aspect of the present invention concerns transgenic animals which 
are comprised of cells (of that animal) which contain a transgene of the present 

25 invention and which preferably (though optionally) express an exogenous DEF 
protein in one or more cells in the animal. A DEF transgene can encode the 
wild-type form of the protein, or can encode homologs thereof, including both 
agonists and antagonists, as well as antisense constructs. In preferred 
embodiments, the expression of the transgene is restricted to specific subsets of 

30 cells, tissues or developmental stages utilizing, for example, cis-acting sequences 
that control expression in the desired pattern. In the present invention, such 
mosaic expression of a DEF protein can be essential for many forms of lineage 
analysis and can additionally provide a means to assess the effects of, for 
example, lack of DEF expression which might grossly alter development in small 

35 patches of tissue within an otherwise normal embryo. Toward this end, tissue- 
specific regulatory sequences and conditional regulatory sequences can be used 
to control expression of the transgene in certain spatial patterns. Moreover, 
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temporal patterns of expression can be provided by, for example, conditional 
recombination systems or prokaryotic transcriptional regulatory sequences. 

Genetic techniques which allow for the expression of transgenes can be 
regulated via site-specific genetic manipulation in vivo are known to those 
5 skilled in the art For instance, genetic systems are available which allow for the 
regulated expression of a recombinase that catalyzes the genetic recombination a 
target sequence. As used herein, the phrase "target sequence" refers to a 
nucleotide sequence that is genetically recombined by a recombinase. The target 
sequence is flanked by recombinase recognition sequences and is generally either 

10 excised or inverted in cells expressing recombinase activity. Recombinase 

catalyzed recombination events can be designed such that recombination of the 
target sequence results in either the activation or repression of expression of one 
of the subject DEF proteins. For example, excision of a target sequence which 
interferes with the expression of a recombinant DEF gene, such as one which 

15 encodes an antagonistic homolog or an antisense transcript, can be designed to 
activate expression of that gene. This interference with expression of the protein 
can result from a variety of mechanisms, such as spatial separation of the DEF 
gene from the promoter element or an internal stop codon. Moreover, the 
transgene can be made wherein the coding sequence of the gene is flanked by 

20 recombinase recognition sequences and is initially transfected into cells in a 3' to 
5' orientation with respect to the promoter element. In such an instance, 
inversion of the target sequence will reorient the subject gene by placing the 5' 
end of the coding sequence in an orientation with respect to the promoter element 
which allow for promoter driven transcriptional activation. 

25 The transgenic animals of the present invention all include within a 

plurality of their cells a transgene of the present invention, which transgene alters 
the phenotype of the "host cell" with respect to regulation of cell growth, death 
and/or differentiation. Since it is possible to produce transgenic organisms of the 
invention utilizing one or more of the transgene constructs described herein, a 

30 general description will be given of the production of transgenic organisms by 
referring generally to exogenous genetic material. This general description can 
be adapted by those skilled in the art in order to incorporate specific transgene 
sequences into organisms utilizing the methods and materials described below. 
In an illustrative embodiment, either the cre/loxP recombinase system of 

35 bacteriophage PI (Lakso et al. (1992) PNAS 89:6232-6236; Orban et al. (1992) 
PNAS 89:6861-6865) or the FLP recombinase system of Saccharomyces 
cerevisiae (O'Gorman et al. (1991) Science 251:1351-1355; PCT publication WO 
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92/15694) can be used to generate in vivo site-specific genetic recombination 
systems. 

Accordingly, genetic recombination of the target sequence is dependent 
on expression of the Cre recombinase. Expression of the recombinase can be 
5 regulated by promoter elements which are subject to regulatory control, e.g., 
tissue-specific, developmental stage-specific, inducible or repressible by 
externally added agents. This regulated control will result in genetic 
recombination of the target sequence only in cells where recombinase expression 
is mediated by the promoter element Thus, the activation expression of a 
10 recombinant DEF protein can be regulated via control of recombinase 
expression. 

Use of the cre/loxP recombinase system to regulate expression of a 
recombinant DEF protein requires the construction of a transgenic animal 
containing transgenes encoding both the Cre recombinase and the subject 

1 5 protein. Animals containing both the Cre recombinase and a recombinant DEF 
gene can be provided through the construction of "double" transgenic animals. A 
convenient method for providing such animals is to mate two transgenic animals 
each containing a transgene, e.g., a DEF gene and recombinase gene. 

One advantage derived from initially constructing transgenic animals 

20 containing a DEF transgene in a recombinase-mediated expressible format 
derives from the likelihood that the subject protein, whether agonistic or 
antagonistic, can be deleterious upon expression in the transgenic animal. In 
such an instance, a founder population, in which the subject transgene is silent in 
all tissues, can be propagated and maintained. Individuals of this founder 

25 population can be crossed with animals expressing the recombinase in, for 
example, one or more tissues and/or a desired temporal pattern. Thus, the 
creation of a founder population in which, for example, an antagonistic DEF 
transgene is silent will allow the study of progeny from that founder in which 
disruption of DEF mediated induction in a particular tissue or at certain 

30 developmental stages would result in, for example, a lethal phenotype. 

Similar conditional transgenes can be provided using prokaryotic 
promoter sequences which require prokaryotic proteins to be simultaneous 
expressed in order to facilitate expression of the DEF transgene. Exemplary 
promoters and the corresponding trans-activating prokaryotic proteins are given 

35 in U.S. Patent No. 4,833,080. 

Moreover, expression of the conditional transgenes can be induced by 
gene therapy-like methods wherein a gene encoding the trans-activating protein, 
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e.g. a recombinase or a prokaryotic protein, is delivered to the tissue and caused 
to be expressed, such as in a cell-type specific manner. By this method, a DEF 
transgene could remain silent into adulthood until "turned on" by the 
introduction of the trans-activator. 
5 In one embodiment, gene targeting, which is a method of using 

homologous recombination to modify an animal's genome, can be used to 
introduce changes into cultured embryonic stem cells. By targeting a DEF gene 
of interest e.g., in embryonic stem (ES) cells, these changes can be introduced 
into the germlines of animals to generate chimeras. The gene targeting 

1 0 procedure is accomplished by introducing into tissue culture cells a DNA 

targeting construct that includes a segment homologous to a target DEF locus, 
and which also includes an intended sequence modification to the DEF genomic 
sequence (e.g., insertion, deletion, point mutation). The treated cells are then 
screened for accurate targeting to identify and isolate those which have been 

15 properly targeted. 

Methods of culturing cells and preparation of knock out constructs for 
insertion are known to the skilled artisan, such as those set forth by Robertson in: 
Teratocarcinomas and Embryonic Stem Cells: A Practical Approach, E.J. 
Robertson, ed. IRL Press, Washington, D.C. [1987]); by Bradley et al. (1986) 

20 Current Topics in Devel. BioL 20:357-37 1 ); and by Hogan et al. (Manipulating 
the Mouse Embryo: A Laboratory Manual, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY [1986]) . 

Introduction of the transgenic constructs nucleotide sequence into the 
embryo may be accomplished by any means known in the art such as, for 

25 example, microinjection, electroporation, calcium phosphate, or lipofection. 
Retroviral infection can also be used to introduce transgene into a non-human 
animal. The developing non-human embryo can be cultured in vitro to the 
blastocyst stage. During this time, the blastomeres can be targets for retroviral 
infection (Jaenich, R. (1976) PNAS 73:1260-1264). 

30 Other methods of making knock-out or disruption transgenic animals are 

also generally known. See, for example, Manipulating the Mouse Embryo, 
(Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). 
Recombinase dependent knockouts can also be generated, e.g. by homologous 
recombination to insert target sequences, such that tissue specific and/or 

35 temporal control of inactivation of a DEF-gene can be controlled by recombinase 
sequences. 
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Animals containing more than one knockout construct and/or more than 
one transgene expression construct are prepared in any of several ways. The 
preferred manner of preparation is to generate a series of mammals, each 
containing one of the desired transgenic phenotypes. Such animals are bred 
5 together through a series of crosses, backcrosses and selections, to ultimately 
generate a single animal containing all desired knockout constructs and/or 
expression constructs, where the animal is otherwise congenic (genetically 
identical) to the wild type except for the presence of the knockout construct(s) 
and/or transgene(s). 

10 

Uses and Methods of the Invention 
VII, Drug Screening Assays 

The present invention also provides for assays which can be used to 
screen for compounds, including DEF homologs, which are either agonists or 

1 5 antagonists of the normal cellular function of the subject DEF polypeptides, or 
portions thereof such as an SH3 domain or a src SID consensus binding 
sequence. Screened compounds, for example agonist of DEF bioactivity, may be 
useful in treating many diseases involving cell proliferation, e.g., metastasis of 
cancer cells. In other embodiments, antagonists of DEF are provided. 

20 For example, potentiators, or alternatively, inhibitors, of an interaction 

between a src SH3 consensus binding sequence and an interacting protein, e.g., a 
protein containing an SH3 domain, e.g., pp60 5rc . A variety of assay formats can 
be used for the subject assays. An exemplary method includes the steps of (a) 
forming a reaction mixture including: (i) a ppdO^, (ii) a DEF or a src SH3 

25 consensus binding sequence, and (iii) a test compound; and (b) detecting 
interaction of the pp60* rc and a DEF polypeptide or a src SH3 consensus 
binding sequence polypeptides. A statistically significant change (potentiation or 
inhibition) in the interaction of the ppeO^ and a DEF polypeptide or a src SH3 
consensus binding sequence in the presence of the test compound, relative to the 

30 interaction in the absence of the test compound, indicates a potential agonist 
(mimetic or potentiator) or antagonist (inhibitor) of said interaction. The 
reaction mixture can be a cell-free protein preparation, e.g., a reconsistuted 
protein mixture or a cell lysate, or it can be a recombinant cell including a 
heterologous nucleic acid recombinantly expressing the DEF polypeptide. 

35 In one embodiment, an assay is provided for screening for modulators of 

an interaction between a DEF polypeptide or various domains thereof, e.g., SH3 
domain or a src SH3 consensus binding sequence, with signaling molecules. 
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In many drug screening programs which test libraries of compounds and 
natural extracts, high throughput assays are desirable in order to maximize the 
number of compounds surveyed in a given period of time. Assays which are 
performed in cell-free systems, such as may be derived with purified or semi- 
5 purified proteins, are often preferred as "primary" screens in that they can be 
generated to permit rapid development and relatively easy detection of an 
alteration in a molecular target which is mediated by a test compound. 
Moreover, the effects of cellular toxicity and/or bioavailability of the test 
compound can be generally ignored in the in vitro system, the assay instead 
1 0 being focused primarily on the effect of the drug on the molecular target as may 
be manifest in an alteration of binding affinity with upstream or downstream 
elements. 

In an exemplary screening assay of the present invention, the compound 
of interest is contacted with proteins which may function upstream (including 

1 5 both activators and repressors of its activity) or to proteins or nucleic acids which 
may function downstream of the DEF polypeptide, whether they are positively or 
negatively regulated by it. To the mixture of the compound and the upstream or 
downstream element is then added a composition containing a DEF polypeptide. 
Detection and quantification of the interaction of DEF with its upstream or 

20 downstream elements provide a means for determining a compound's efficacy at 
inhibiting (or potentiating) complex formation between DEF and the DEF- 
binding elements. The term "interact" as used herein is meant to include 
detectable interactions between molecules, such as can be detected using, for 
example, a yeast two hybrid assay. The term interact is also meant to include 

25 "binding" interactions between molecules. Interactions may be protein-protein 
or protein-nucleic acid in nature. 

The efficacy of the compound can be assessed by generating dose 
response curves from data obtained using various concentrations of the test 
compound. Moreover, a control assay can also be performed to provide a 

30 baseline for comparison. In the control assay, isolated and purified DEF 

polypeptide is added to a composition containing the DEF-binding element, and 
the formation of a complex is quantitated in the absence of the test compound. 

Complex formation between the DEF polypeptide and a DEF binding 
element may be detected by a variety of techniques. Modulation of the 

35 formation of complexes can be quantitated using, for example, detectably labeled 
proteins such as radiolabeled, fluorescently labeled, or enzymatically labeled 
DEF polypeptides, by immunoassay, or by chromatographic detection. 
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Typically, it will be desirable to immobilize either DEF or its binding 
protein to facilitate separation of complexes from uncomplexed forms of one or 
both of the proteins, as well as to accommodate automation of the assay. 
Binding of DEF to an upstream or downstream element, in the presence and 
5 absence of a candidate agent, can be accomplished in any vessel suitable for 
containing the reactants. Examples include microtitre plates, test tubes, and 
micro-centrifuge tubes. In one embodiment, a fusion protein can be provided 
which adds a domain that allows the protein to be bound to a matrix. For 
example, glutathione-S-transferase/DEF (GST/DEF) fusion proteins can be 

10 adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or 
glutathione derivatized microtitre plates, which are then combined with the cell 
lysates, e.g. an 35S-labeled, and the test compound, and the mixture incubated 
under conditions conducive to complex formation, e.g. at physiological 
conditions for salt and pH, though slightly more stringent conditions may be 

1 5 desired. Following incubation, the beads are washed to remove any unbound 
label, and the matrix immobilized and radiolabel determined directly (e.g. beads 
placed in scintilant), or in the supernatant after the complexes are subsequently 
dissociated. Alternatively, the complexes can be dissociated from the matrix, 
separated by SDS-PAGE, and the level of DEF-binding protein found in the bead 

20 fraction quantitated from the gel using standard electrophoretic techniques such 
as described in the appended examples. 

Other techniques for immobilizing proteins on matrices are also available 
for use in the subject assay. For instance, either DEF or its cognate binding 
protein can be immobilized utilizing conjugation of biotin and streptavidin. For 

25 instance, biotinylated DEF molecules can be prepared from biotin-NHS (N- 

hydroxy-succinimide) using techniques well known in the art (e.g., biotinylation 
kit, Pierce Chemicals, Rockford, IL), and immobilized in the wells of 
streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, antibodies 
reactive with DEF but which do not interfere with binding of upstream or 

30 downstream elements can be derivatized to the wells of the plate, and DEF 

trapped in the wells by antibody conjugation. As above, preparations of a DEF- 
binding protein and a test compound are incubated in the DEF-presenting wells 
of the plate, and the amount of complex trapped in the well can be quantitated. 
Exemplary methods for detecting such complexes, in addition to those described 

35 above for the GST-immobilized complexes, include immunodetection of 

complexes using antibodies reactive with the DEF binding element, or which are 
reactive with DEF protein and compete with the binding element; as well as 
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enzyme-linked assays which rely on detecting an enzymatic activity associated 
with the binding element, either intrinsic or extrinsic activity. In the instance of 
the latter, the enzyme can be chemically conjugated or provided as a fusion 
protein with the DEF-BP. To illustrate, the DEF-BP can be chemically cross- 
5 linked or genetically fused with horseradish peroxidase, and the amount of 

polypeptide trapped in the complex can be assessed with a chromogenic substrate 
of the enzyme, e.g. 3,3'-chamino-benzadine terahydrochloride or 4-chloro-l- 
napthol. Likewise, a fusion protein comprising the polypeptide and glutathione- 
s-transferase can be provided, and complex formation quantitated by detecting 
10 the GST activity using 1 -chloro-2,4-dinitrobenzene (Habig et al (1 974) J Biol 
Chem 249:7130). 

For processes which rely on immunodetection for quantitating one of the 
proteins trapped in the complex, antibodies against the protein, such as anti-DEF 
antibodies, can be used. Alternatively, the protein to be detected in the complex 

1 5 can be "epitope tagged" in the form of a fusion protein which includes, in 
addition to the DEF sequence, a second polypeptide for which antibodies are 
readily available (e.g. from commercial sources). For instance, the GST fusion 
proteins described above can also be used for quantification of binding using 
antibodies against the GST moiety. Other useful epitope tags include myc- 

20 epitopes (e.g., see Ellison et al. (1991) J Biol Chem 266:21 150-21 157) which 
includes a 10-residue sequence from c-myc, as well as the pFLAG system 
(International Biotechnologies, Inc.) or the pEZZ-protein A system (Pharamacia, 
NJ). 

In addition to cell-free assays, such as described above, the readily 
25 available source of mammalian DEF proteins provided by the present invention 
also facilitates the generation of cell-based assays for identifying small molecule 
agonists/antagonists and the like. For example, cells can be caused to 
overexpress a recombinant DEF protein in the presence and absence of a test 
compound of interest, with the assay scoring for modulation in DEF responses 
30 by the target cell mediated by the test agent. As with the cell-free assays, 

compounds which produce a statistically significant change in DEF-dependent 
responses (either inhibition or potentiation) can be identified. In an illustrative 
embodiment, the expression or activity of a DEF is modulated embryos or cells 
and the effects of compounds of interest on the readout of interest (such as 
35 apoptosis) are measured. For example, the expression of genes which are up- or 
down-regulated in response to a DEF-dependent signal cascade can be assayed. 
In preferred embodiments, the regulatory regions of such genes, e.g., the 5* 
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flanking promoter and enhancer regions, are operatively linked to a marker (such 
as luciferase) which encodes a gene product that can be readily detected. 

Monitoring the influence of compounds on cells may be applied not only 
in basic drug screening, but also in clinical trials. In such clinical trials, the 
5 expression of a panel of genes may be used as a "read out" of a particular drug's 
therapeutic effect. 

In another aspect of the invention, the subject DEF polypeptides can be 
used to generate a "two hybrid" assay (see, for example, U.S. Patent No. 
5,283,3 17; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J Biol 

10 Chem 268:12046-12054; Bartel et al. (1993) Biotechniques 14:920-924; 
Iwabuchi et al. (1993) Oncogene 8:1693-1696; and Brent WO94/10300), for 
isolating coding sequences for other cellular proteins which bind to or interact 
with DEF ("DEF-binding proteins" or M DEF-bp\ Such DEF-binding proteins 
would likely regulators of DEF bioactivity. 

1 5 Briefly, the two hybrid assay relies on reconstituting in vivo a functional 

transcriptional activator protein from two separate fusion proteins. In particular, 
the method makes use of chimeric genes which express hybrid proteins. To 
illustrate, a first hybrid gene comprises the coding sequence for a DNA-binding 
domain of a transcriptional activator fused in frame to the coding sequence for a 

20 DEF polypeptide. The second hybrid protein encodes a transcriptional activation 
domain fused in frame to a sample gene from a cDNA library. If thebait and 
sample hybrid proteins are able to interact, e.g., form a DEF-dependent complex, 
they bring into close proximity the two domains of the transcriptional activator. 
This proximity is sufficient to cause transcription of a reporter gene which is 

25 operatively linked to a transcriptional regulatory site responsive to the 

transcriptional activator, and expression of the reporter gene can be detected and 
used to score for the interaction of the DEF and sample proteins. 

VIII. Diagnostic and Prognostic Assays 

30 The invention provides a method for detecting the presence of DEF in a 

biological sample. The method involves contacting the biological sample with 
an agent capable of detecting DEF protein or mRNA such that the presence of 
DEF is detected in the biological sample. A preferred agent for detecting DEF 
mRNA is a labeled or labelable nucleic acid probe capable of hybridizing to DEF 

35 mRNA. The nucleic acid probe can be, for example, the full-length DEF cDNA 
of SEQ ID NO: 1 , SEQ ID NO: 3 or SEQ ID NO: 5, SEQ ID NO: 6 or SEQ ID 
NO: 8, or SEQ ID NO: 9 or SEQ ID NO: 1 1 , or a portion thereof, such as an 
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oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and 
sufficient to specifically hybridize under stringent conditions to DEF mRNA. A 
preferred agent for detecting DEF protein is a labeled or labelable antibody 
capable of binding to DEF protein. Antibodies can be polyclonal, or more 
5 preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or 
F(ab')2) can be used. The term "labeled or labelable", with regard to the probe or 
antibody, is intended to encompass direct labeling of the probe or antibody by 
coupling (i.e., physically linking) a detectable substance to the probe or antibody, 
as well as indirect labeling of the probe or antibody by reactivity with another 

1 0 reagent that is directly labeled. Examples of indirect labeling include detection 
of a primary antibody using a fluorescently labeled secondary antibody and end- 
labeling of a DNA probe with biotin such that it can be detected with 
fluorescently labeled streptavidin. The term "biological sample" is intended to 
include tissues, cells and biological fluids isolated from a subject, as well as 

1 5 tissues, cells and fluids present within a subject. That is, the detection method of 
the invention can be used to detect DEF mRNA or protein in a biological sample 
in vitro as well as in vivo. For example, in vitro techniques for detection of DEF 
mRNA include Northern hybridizations and in situ hybridizations. In vitro 
techniques for detection of DEF protein include enzyme linked immunosorbent 

20 assays (ELISAs), Western blots, immunoprecipitations and immunofluorescence. 
Alternatively, DEF protein can be detected in vivo in a subject by introducing 
into the subject a labeled anti-DEF antibody. For example, the antibody can be 
labeled with a radioactive marker whose presence and location in a subject can 
be detected by standard imaging techniques. 

25 Accordingly, the invention provides a diagnostic method comprising: 

contacting a sample from a subject with an agent capable of detecting 
DEF protein or mRNA; 

determining the amount of DEF protein or mRNA expressed in the 
sample; 

30 comparing the amount of DEF protein or mRNA expressed in the sample 

to a control sample; and 

forming a diagnosis based on the amount of DEF protein or mRNA 
expressed in the sample as compared to the control sample. 

35 The invention also encompasses kits for detecting the presence of DEF in 

a biological sample. For example, the kit can comprise a labeled or labelable 
agent capable of detecting DEF protein or mRNA in a biological sample; means 
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for determining the amount of DEF in the sample; and means for comparing the 
amount of DEF in the sample with a standard. The agent can be packaged in a 
suitable container. The kit can further comprise instructions for using the kit to 
detect DEF mRNA or protein. 
5 The diagnostic methods of the present invention are elaborated further 

below. In preferred embodiments, the methods can be characterized as 
comprising detecting, in a sample of cells from the subject, the presence or 
absence of a genetic lesion characterized by at least one of (i) an alteration 
affecting the integrity of a gene encoding a DEF-protein, or (ii) the mis- 

10 expression of the DEF gene. To illustrate, such genetic lesions can be detected 
by ascertaining the existence of at least one of (i) a deletion of one or more 
nucleotides from a DEF gene, (ii) an addition of one or more nucleotides to a 
DEF gene, (iii) a substitution of one or more nucleotides of a DEF gene, (iv) a 
gross chromosomal rearrangement of a DEF gene, (v) a gross alteration in the 

1 5 level of a messenger RNA transcript of a DEF gene, (vii) aberrant modification 
of a DEF gene, such as of the methylation pattern of the genomic DNA, (vii) the 
presence of a non-wild type splicing pattern of a messenger RNA transcript of a 
DEF gene, (viii) a non-wild type level of a DEF-protein, (ix) allelic loss of a 
DEF gene, and (x) inappropriate post-translational modification of a DEF- 

20 protein. As set out below, the present invention provides a large number of assay 
techniques for detecting lesions in a DEF gene, and importantly, provides the 
ability to discern between different molecular causes underlying DEF-dependent 
aberrant bioactivity of a DEF popypeptide. 

In an exemplary embodiment a nucleic acid composition is provided 

25 which contains an oligonucleotide probe previously described. The nucleic acid 
of a cell is rendered accessible for hybridization, the probe is exposed to nucleic 
acid of the sample, and the hybridization of the probe to the sample nucleic acid 
is detected. Such techniques can be used to detect lesions at either the genomic 
or mRNA level, including deletions, substitutions, etc., as well as to determine 

30 mRNA transcript levels. 

In certain embodiments, detection of the lesion comprises utilizing the 
probe/primer in a polymerase chain reaction (PCR) (see, e.g. U.S. Patent Nos. 
4,683,195 and 4,683,202), such as anchor PCR or RACE PCR, or, alternatively, 
in a ligation chain reaction (LCR) (see, e.g., Landegran et al. (1988) Science 

35 241 :1077-1080; and Nakazawa et al. (1994) PNAS 91:360-364), the latter of 
which can be particularly useful for detecting point mutations in the DEF-gene 
(see Abrayaya et al. (1995) Nuc Acid Res 23:675-682). In a merely illustrative 
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embodiment, the method includes the steps of (i) collecting a sample of cells 
from a patient, (ii) isolating nucleic acid (e.g., genomic, mRNA or both) from the 
cells of the sample, (iii) contacting the nucleic acid sample with one or more 
primers which specifically hybridize to a DEF gene under conditions such that 
5 hybridization and amplification of the DEF-gene (if present) occurs, and (iv) 
detecting the presence or absence of an amplification product, or detecting the 
size of the amplification product and comparing the length to a control sample. 
It is anticipated that PCR and/or LCR may be desirable to use as a preliminary 
amplification step in conjunction with any of the techniques used for detecting 

10 mutations described herein. 

Alternative amplification methods include: self sustained sequence 
replication (Guatelli, J.C. et al., 1990, Proc. Natl. Acad. Sci. USA 87:1874- 
1 878), transcriptional amplification system (Kwoh, D. Y. et al., 1 989, Proc. Natl. 
Acad. Sci. USA 86:1 173-1 177), Q-Beta Replicase (Lizardi, P.M. et al., 1988, 

1 5 Bio/Technology 6: 1 1 97), or any other nucleic acid amplification method, 
followed by the detection of the amplified molecules using techniques well 
known to those of skill in the art. These detection schemes are especially useful 
for the detection of nucleic acid molecules if such molecules are present in very 
low numbers. 

20 In another embodiment of the subject assay, mutations in a DEF gene 

from a sample cell are identified by alterations in restriction enzyme cleavage 
patterns. For example, sample and control DNA is isolated, amplified 
(optionally), digested with one or more restriction endonucleases, and fragment 
length sizes are determined by gel electrophoresis. Moreover, the use of 

25 sequence specific ribozymes (see, for example, U.S. Patent No. 5,498,531) can 
be used to score for the presence of specific mutations by development or loss of 
a ribozyme cleavage site. 

In yet another embodiment, any of a variety of sequencing reactions 
known in the art can be used to directly sequence the DEF gene and detect 

30 mutations by comparing the sequence of the sample DEF with the corresponding 
wild-type (control) sequence. Exemplary sequencing reactions include those 
based on techniques developed by Maxim and Gilbert (Proc. Natl Acad Sci USA 
(1977) 74:560) or Sanger (Sanger et al (1977) Proc. Nat. Acad. Sci 74:5463). 
Any of a variety of automated sequencing procedures may be utilized when 

35 performing the subject assays (Biotechniques (1995) 19:448), including by 
sequencing by mass spectrometry (see, for example PCT publication WO 
94/16101; Cohen et al. (1996) Adv Chromatogr 36:127-162; and Griffin et al. 
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(1993) Appl Biochem Biotechnol 38:147-159). It will be evident to one skilled 
in the art that, for certain embodiments, the occurence of only one, two or three 
of the nucleic acid bases need be determined in the sequencing reaction. For 
instance, A-tract sequencing where only one nucleic acid is detected, can be 
5 carried out. 

In a further embodiment, protection from cleavage agents (such as a 
nuclease, hydroxylamine or osmium tetroxide and with piperidine) can be used 
to detect mismatched bases in RNA/RNA or RNA/DNA heteroduplexes (Myers, 
et al. (1985) Science 230:1242). In general, the art technique of "mismatch 

10 cleavage" starts by providing heteroduplexes formed by hybridizing (labelled) 
RN A or DNA containing the wild-type DEF sequence with potentially mutant 
RNA or DNA obtained from a tissue sample. The double-stranded duplexes are 
treated with an agent which cleaves single-stranded regions of the duplex such as 
which will exist due to basepair mismatches between the control and sample 

15 strands. For instance, RNA/DNA duplexes can be treated with RNase and 
DNA/DNA hybrids treated with SI nuclease to enzymatically digesting the 
mismatched regions. In other embodiments, either DNA/DNA or RNA/DNA 
duplexes can be treated with hydroxylamine or osmium tetroxide and with 
piperidine in order to digest mismatched regions. After digestion of the 

20 mismatched regions, the resulting material is then separated by size on 

denaturing polyacrylamide gels to determine the site of mutation. See, for 
example, Cotton et al (1988) Proc. Natl Acad Sci USA 85:4397; Saleeba et al 
(1992) Methods Enzymol 217:286-295. In a preferred embodiment, the control 
DNA or RNA can be labeled for detection. 

25 In still another embodiment, the mismatch cleavage reaction employs one 

or more proteins that recognize mismatched base pairs in double-stranded DNA 
(so called "DNA mismatch repair" enzymes) in defined systems for detecting and 
mapping point mutations in DEF cDNAs obtained from samples of cells. For 
example, the mutY enzyme of E. coli cleaves A at G/A mismatches and the 

30 thymidine DNA glycosylase from HeLa cells cleaves T at G/T mismatches (Hsu 
et al. (1994) Carcinogenesis 15:1657-1662). According to an exemplary 
embodiment, a probe based on a DEF sequence, e.g., a wild-type DEF sequence, 
is hybridized to a cDNA or other DNA product from a test cell(s). The duplex is 
treated with a DNA mismatch repair enzyme, and the cleavage products, if any, 

35 can be detected from electrophoresis protocols or the like. See, for example, 
U.S. Patent No. 5,459,039. 
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In other embodiments, alterations in electrophoretic mobility will be used 
to identify mutations in DEF genes. For example, single strand conformation 
polymorphism (SSCP) may be used to detect differences in electrophoretic 
mobility between mutant and wild type nucleic acids (Orita et aL (1989) Proc 
5 Natl Acad, Sci USA 86:2766, see also Cotton (1993) Mutat Res 285:125-144; 
and Hayashi (1 992) Genet Anal Tech Appl 9:73-79). Single-stranded DNA 
fragments of sample and control DEF nucleic acids will be denatured and 
allowed to renature. The secondary structure of single-stranded nucleic acids 
varies according to sequence, the resulting alteration in electrophoretic mobility 

10 enables the detection of even a single base change. The DNA fragments may be 
labelled or detected with labelled probes. The sensitivity of the assay may be 
enhanced by using RNA (rather than DNA), in which the secondary structure is 
more sensitive to a change in sequence. In a preferred embodiment, the subject 
method utilizes heteroduplex analysis to separate double stranded heteroduplex 

1 5 molecules on the basis of changes in electrophoretic mobility (Keen et al. (1991) 
Trends Genet 7:5). 

In yet another embodiment the movement of mutant or wild-type 
fragments in polyacrylamide gels containing a gradient of denaturant is assayed 
using denaturing gradient gel electrophoresis (DGGE) (Myers et al (1985) 

20 Nature 313 :495). When DGGE is used as the method of analysis, DNA will be 
modified to insure that it does not completely denature, for example by adding a 
GC clamp of approximately 40 bp of high-melting GC-rich DNA by PCR. In a 
further embodiment, a temperature gradient is used in place of a denaturing agent 
gradient to identify differences in the mobility of control and sample DNA 

25 (Rosenbaum and Reissner (1987) Biophys Chem 265: 12753). 

Examples of other techniques for detecting point mutations include, but 
are not limited to, selective oligonucleotide hybridization, selective 
amplification, or selective primer extension. For example, oligonucleotide 
primers may be prepared in which the known mutation is placed centrally and 

30 then hybridized to target DNA under conditions which permit hybridization only 
if a perfect match is found (Saiki et al. (1986) Nature 324:163); Saiki et al (1989) 
Proc. Natl Acad. Sci USA 86:6230). Such allele speicific oligonucleotide 
hybridization techniques may be used to test one mutation per reaction when 
oligonucleotides are hybridized to PCR amplified target DNA or a number of 

35 different mutations when the oligonucleotides are attached to the hybridizing 
membrane and hybridized with labelled target DNA. 
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Alternatively, allele specific amplification technology which depends on 
selective PCR amplification may be used in conjunction with the instant 
invention. Oligonucleotides used as primers for specific amplification may carry 
the mutation of interest in the center of the molecule (so that amplification 
5 depends on differential hybridization) (Gibbs et al (1989) Nucleic Acids Res. 
17:2437-2448) or at the extreme 3' end of one primer where, under appropriate 
conditions, mismatch can prevent, or reduce polymerase extension (Prossner 
(1993) Tibtech 1 1 :238. In addition it may be desirable to introduce a novel 
restriction site in the region of the mutation to create cleavage-based detection 

10 (Gasparini et al (1992) Mol. Cell Probes 6:1). It is anticipated that in certain 
embodiments amplification may also be performed using Taq ligase for 
amplification (Barany (1991) Proc. Natl. Acad. Sci USA 88:189). In such cases, 
ligation will occur only if there is a perfect match at the 3' end of the 5' sequence 
making it possible to detect the presence of a known mutation at a specific site 

15 by looking for the presence or absence of amplification. 

The methods described herein may be performed, for example, by 
utilizing pre-packaged diagnostic kits comprising at least one probe nucleic acid 
or antibody reagent described herein, which may be conveniently used, e.g., in 
clinical settings to diagnose patients exhibiting symptoms or family history of a 

20 disease or illness involving a DEF gene. 

Diagnostic procedures may also be performed in situ directly upon tissue 
sections (fixed and/or frozen) of patient tissue obtained from biopsies or 
resections, such that no nucleic acid purification is necessary. Nucleic acid 
reagents may be used as probes and/or primers for such in situ procedures (see, 

25 for example, Nuovo, G J., 1 992, PCR in situ hybridization: protocols and 
applications, Raven Press, NY). 

In addition to methods which focus primarily on the detection of one 
nucleic acid sequence, profiles may also be assessed in such detection schemes. 
Fingerprint profiles may be generated, for example, by utilizing a differential 

30 display procedure, Northern analysis and/or RT-PCR. 

Antibodies directed against wild type or mutant DEF proteins, which are 
discussed, above, may also be used in disease diagnostics and prognostics. Such 
diagnostic methods, may be used to detect abnormalities in the level of DEF 
protein expression, or abnormalities in the structure and/or tissue, cellular, or 

35 subcellular location of DEF protein. Structural differences may include, for 
example, differences in the size, electronegativity, or antigenicity of the mutant 
DEF protein relative to the normal DEF protein. Protein from the tissue or cell 
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type to be analyzed may easily be detected or isolated using techniques which are 
well known to one of skill in the art, including but not limited to western blot 
analysis. For a detailed explanation of methods for carrying out western blot 
analysis, see Sambrook et al; 1989, supra, at Chapter 18. The protein detection 
5 and isolation methods employed herein may also be such as those described in 
Harlow and Lane, for example, (Harlow, E. and Lane, D., 1988, "Antibodies: A 
Laboratory Manual", Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
New York), which is incorporated herein by reference in its entirety. 

This can be accomplished, for example, by immunofluorescence 

1 0 techniques employing a fluorescently labeled antibody (see below) coupled with 
light microscopic, flow cytometric, or fluorimetric detection. The antibodies (or 
fragments thereof) useful in the present invention may, additionally, be 
employed histologically, as in immunofluorescence or immunoelectron 
microscopy, for in situ detection of DEF proteins. In situ detection may be 

15 accomplished by removing a histological specimen from a patient, and applying 
thereto a labeled antibody of the present invention. The antibody (or fragment) is 
preferably applied by overlaying the labeled antibody (or fragment) onto a 
biological sample. Through the use of such a procedure, it is possible to 
determine not only the presence of the DEF protein, but also its distribution in 

20 the examined tissue. Using the present invention, one of ordinary skill will 
readily perceive that any of a wide variety of histological methods (such as 
staining procedures) can be modified in order to achieve such in situ detection. 

Often a solid phase support or carrier is used as a support capable of 
binding an antigen or an antibody. Well-known supports or carriers include 

25 glass, polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, 

natural and modified celluloses, polyacrylamides, gabbros, and magnetite. The 
nature of the carrier can be either soluble to some extent or insoluble for the 
purposes of the present invention. The support material may have virtually any 
possible structural configuration so long as the coupled molecule is capable of 

30 binding to an antigen or antibody. Thus, the support configuration may be 

spherical, as in a bead, or cylindrical, as in the inside surface of a test tube, or the 
external surface of a rod. Alternatively, the surface may be flat such as a sheet, 
test strip, etc. Preferred supports include polystyrene beads. Those skilled in the 
art will know many other suitable carriers for binding antibody or antigen, or will 

35 be able to ascertain the same by use of routine experimentation. 

Moreover, any of the above methods for detecting alterations in a DEF 
gene or gene product can be used to monitor the course of treatment or therapy. 
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IX. Methods of modulating cell differentiation 

In another aspect, this invention features methods for inhibiting the 
proliferation and/or reversing the transformed phenotype of a hyperproliferative 
5 cells by the ectopic expression of DEF, or by contacting the cells with a DEF 
agonist. In general, the method includes a step of contacting pathological 
hyperproliferative cells with an amount of a DEF agonist effective for promoting 
the differentiation of the hyperproliferative cells. Alternatively, the a method of 
ectopic expression of DEF in a hyperproliferative cell is described in Examples 7 

10 and 8. The present method can be performed on cells in culture, e.g., in vitro or 
ex vivo, or can be performed on cells present in an animal subject, e.g., as part of 
an in vivo therapeutic protocol. The therapeutic regimen can be carried out on a 
human or other animal subject 

While the DEF activation can be utilized alone, the subject method can 

15 be combined with other therapeutics, e.g., such as cell cycle inhibitors, agents 
which promote apoptpsis, agents which strengthen the immune response, and/or 
PPARy agonists. 

In one embodiment, the cells to be treated are hyperproliferative cells of 
adipocytic lineage, e.g., arising from adipose or adipose precursor cells. In 

20 certain embodiments, the adipose cells show an aberrant activity of at least one 
process mediated by PPARy. As employed herein, the phrase "processes 
mediated by PPARy" refers to biological, physiological, endocrinological, and 
other bodily processes which are mediated by receptor or receptor combinations 
which are responsive to the PPAR-y-selective prostaglandin or prostaglandin-like 

25 compounds described herein. Such processes include cell differentiation to 

produce lipid-accumulation cells, modulation of blood glucose levels and insulin 
sensitivity, regulation of leptin levels and subsequent feeding levels (for the 
control of satiety and/or appetite), regulation of thermogenesis and fatty acid 
metabolism, regulation of fat levels for the treatment of lipodystrophies, control 

30 of cell differentiation for the treatment of myxoid liposarcomas, regulation of 
triglyceride levels and lipoproteins for the treatment of hyperlipidemia, 
modulation of genes expressed in adipose cells (e.g., leptin, lipoprotein, lipase, 
uncoupling protein, and the like), and the like. 

The term "PPARy" refers to members of the peroxisome proliferator- 

35 activated receptors family which are expressed, inter alia, in adipocytic and 
hematopoietic cells (Braissant, O. et al. Endocrinology 137(1): 354-66), and 
which function as key regulators of differentiation. Contemplated within this 
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definition are variants thereof, as for example, PPARyl and PPARy2 which are 
two isoforms having a different N-terminal generated by alternate splicing of a 
primary RNA transcript (Tontonoz, P. et al. (1994), Genes & Dev. 8:1224^34; 
Zhu et al. (1993) J. Biol Chem. 268: 26817-20). 
5 In other embodiments, the instant method can be carried out to prevent 

the proliferation of an adipose cell tumor. The adipose tumor cells can be of a 
liposarcoma. The term "liposarcoma" is recognized by those skilled in the art 
and refers to a malignant tumor characterized by large anaplastic lipoblasts, 
sometimes with foci of normal fat cells. Exemplary liposarcoma types which are 

1 0 can be treated by the present invention include, but are not limited to, well 

differentiated/dedififerentiated, myxoid/round cell and pleomorphic (reviewed in 
Sreekantaiah, C. et al., (1994) supra). 

Another adipose cell tumor which may be treated by the present method 
include lipomas, e.g., benign fatty tumors usually composed of mature fat cells. 

1 5 Likewise, the method of the present invention can be used in the treatment and/or 
prophylaxis of lipochondromas, lipofibromas and lipogranulomas. 
Lipochondroma are tumors composed of mature lipomatous and cartilaginous 
elements; lipofibromas are lipomas containing areas of fibrosis; and 
lipogranuloma are characterized by nodules of lipoid material associated with 

20 granulomatous inflammation. 

The subject method may also be used to inhibit the proliferation of 
hyperplastic/neoplastic cells of hematopoietic origin, e.g., arising from myeloid, 
lymphoid or erythroid lineages, or precursor cells thereof. 

As used herein, the terms "hyperproliferative" and "neoplastic" are used 
25 interchangeably, and refer to those cells an abnormal state or condition 

characterized by rapid proliferation or neoplasm. The terms are meant to include 
all types of cancerous growths or oncogenic processes, metastatic tissues or 
malignantly transformed cells, tissues, or organs, irrespective of histopathologic 
type or stage of invasiveness. "Pathologic hyperproliferative" cells occur in 
30 disease states characterized by malignant tumor growth. 

The term "adipose cell tumor" refers to all cancers or neoplasias arising 
from cells of adipocyte lineage, e.g., arising from adipose or adipose precursor 
cells. The adipose cell tumors include both common and uncommon, benign and 
malignant lesions, such as lipoma, intramuscular and intermuscular lipoma, 
35 neural fibrolipoma, lipoblastoma, lipomatosis, hibernoma, hemangioma and 

liposarcoma, as well as lesions that may mimic fat-containing soft-tissue masses. 
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The term "carcinoma" is recognized by those skilled in the art and refers 
to malignancies of epithelial or endocrine tissues including respiratory system 
carcinomas, gastrointestinal system carcinomas, genitourinary system 
carcinomas, testicular carcinomas, breast carcinomas, prostatic carcinomas, 
5 endocrine system carcinomas, and melanomas. Exemplary carcinomas include 
those forming from tissue of the cervix, lung, prostate, breast, head and neck, 
colon and ovary. The term also includes carcinosarcomas, e.g., which include 
malignant tumors composed of carcinomatous and sarcomatous tissues. An 
"adenocarcinoma" refers to a carcinoma derived from glandular tissue or in 
1 0 which the tumor cells form recognizable glandular structures. 

The term "sarcoma" is recognized by those skilled in the art and refers to 
malignant tumors of mesenchymal derivation. 

As used herein the term "leukemic cancer" refers to all cancers or 
neoplasias of the hemopoietic and immune systems (blood and lymphatic 

1 5 system). The acute and chronic leukemias, together with the other types of 
tumors of the blood, bone marrow cells (myelomas), and lymph tissue 
(lymphomas), cause about 10% of all cancer deaths and about 50% of all cancer 
deaths in children and adults less than 30 years old. Chronic myelogenous 
leukemia (CML), also known as chronic granulocytic leukemia (CGL), is a 

20 neoplastic disorder of the hematopoietic stem cell. The term "leukemia" is 
recognized by those skilled in the art and refers to a progressive, malignant 
disease of the blood-forming organs, marked by distorted proliferation and 
development of leukocytes and their precursors in the blood and bone marrow. 
For instance, the present invention provides for the treatment of various 

25 myeloid disorders including, but not limited to, acute promyeloid leukemia 
(APML), acute myelogenous leukemia (AML) and chronic myelogenous 
leukemia (CML) (reviewed in Vaickus, L. (1991) Crit Rev. in Oncol./HemotoL 
1 1 :267-97). Lymphoid malignancies which may be treated by the subject 
method include, but are not limited to acute lymphoblastic leukemia (ALL), 

30 which includes B-iineage ALL and T-lineage ALL, chronic lymphocytic 

leukemia (CLL), prolymphocyte leukemia (PLL), hairy cell leukemia (HLL) and 
Waldenstrom's macroglobulinemia (WM). Additional forms of malignant 
lymphomas contemplated by the treatment method of the present invention 
include, but are not limited to, non-Hodgkin's lymphoma and variants thereof, 

35 peripheral T-cell lymphomas, adult T-cell leukemia/lymphoma (ATL), cutaneous 
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T-cell lymphoma (CTCL), large granular lymphocytic leukemia (LGF) and 
Hodgkin's disease. 

The subject method can also be useful in treating malignancies of the 
various organ systems, such as those affecting lung, breast, lymphoid, 
5 gastrointestinal, and genito-urinary tract as well as adenocarcinomas which 
include malignancies such as most colon cancers, renal-cell carcinoma, prostate 
cancer and/or testicular tumors, non-small cell carcinoma of the lung, cancer of 
the small intestine and cancer of the esophagus. According to the general 
paradigm of PPARy involvement in differentiation of transformed cells, 

1 0 exemplary solid tumors that can be treated according to the method of the present 
invention include sarcomas and carcinomas with PPARy-responsive phenotypes, 
such as, but not limited to: fibrosarcoma, myxosarcoma, liposarcoma, 
chondrosarcoma, osteogenic sarcoma, chordoma, angiosarcoma, 
endotheliosarcoma, lymphangiosarcoma, lymphangioendotheliosarcoma, 

15 synovioma, mesothelioma, Ewing's tumor, leiomyosarcoma, rhabdomyosarcoma, 
colon carcinoma, pancreatic cancer, breast cancer, ovarian cancer, prostate 
cancer, squamous cell carcinoma, basal cell carcinoma, adenocarcinoma, sweat 
gland carcinoma, sebaceous gland carcinoma, papillary carcinoma, papillary 
adenocarcinomas, cystadenocarcinoma, medullary carcinoma, bronchogenic 

20 carcinoma, renal cell carcinoma, hepatoma, bile duct carcinoma, 

choriocarcinoma, seminoma, embryonal carcinoma, Wilms 1 tumor, cervical 
cancer, testicular tumor, lung carcinoma, small cell lung carcinoma, bladder 
carcinoma, epithelial carcinoma, glioma, astrocytoma, medulloblastoma, 
craniopharyngioma, ependymoma, pinealoma, hemangioblastoma, acoustic 

25 neuroma, oligodendroglioma, meningioma, melanoma, neuroblastoma, and 
retinoblastoma. 

In another embodiment, the present methods can be used in vitro to induce and/or 
maintain the differentiation of neural crest cells into glial cells, Schwann cells, 
chromaffin cells, cholinergic sympathetic or parasympatheticneurons, as well as 
30 peptidergic and serotonergic neurons. The DEF protein can be used alone, or can be 
used in combination with other neurotrophic factors which act to more particularly 
enhance a particular differentiation fate of the neuronal progenitor cell. 



35 



In addition to cell culture applicationst and other in vitro uses described above, 
yet another aspect of the present invention concerns the therapeutic application of a DEF 
molecules to enhance survival of neurons and other neuronal cells in both the central 
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nervous system and the peripheral nervous system. The ability of DEF molecules to 
regulate neuronal differentiation during development of the nervous system and also 
presumably in the adult state indicates that certain of the DEF molecules can be 
reasonably expected to facilitate control of adult neurons with regard to maintenance, 
5 functional performance, and aging of normal cells; repair and regeneration processes in 
chemically or mechanically lesioned cells; and prevention of degeneration and premature 
death which result from loss of differentiation in certain pathological conditions. In light 
of this understanding, the present invention specifically contemplates applications of the 
subject method to the treatment of (prevention and/or reduction of the severity of) 

10 neurological conditions deriving from: (i) acute, subacute, or chronic injury to the 
nervous system, including traumatic injury, chemical injury, vasal injury and deficits 
(such as the ischemia resulting from stroke), together with infectious/inflammatory and 
tumor-induced injury; (ii) aging of the nervous system including Alzheimer's disease; 
(iii) chronic neurodegenerative diseases of the nervous system, including Parkinson's 

1 5 disease, Huntington's chorea, amy lotrophic lateral sclerosis and the like, as well as 

spinocerebellar degenerations; and (iv) chronic immunological diseases of the nervous 
system or affecting the nervous system, including multiple sclerosis. 

Many neurological disorders are associated with degeneration of discrete 
populations of neuronal elements and may be treatable with a therapeutic regimen which 

20 includes a hedgehog agonist. For example, Alzheimer's disease is associated with 

deficits in several neurotransmitter systems, both those that project to the neocortex and 
those that reside with the cortex. For instance, the nucleus basalis in patients with 
Alzheimer's disease have been observed to have a profound (75%) loss of neurons 
compared to age-matched controls. Although Alzheimer's disease is by far the most 

25 common form of dementia, several other disorders can produce dementia. Several of 
these are degenerative diseases characterized by the death of neurons in various parts of 
the central nervous system, especially the cerebral cortex. However, some forms of 
dementia are associated with degeneration of the thalmus or the white matter underlying 
the cerebral cortex. Here, the cognitive dysfunction results from the isolation of cortical 

30 areas by the degeneration of efferents and afferents. Huntington's disease involves the 
degeneration of intrastraital and cortical cholinergic neurons and GABAergic neurons. 
Pick's disease is a severe neuronal degeneration in the neocortex of the frontal and 
anterior temporal lobes, sometimes accompanied by death of neurons in the striatum. 
Treatment of patients suffering from such degenerative conditions can include the 

35 application of DEF molecules, or agents which mimic their effects, in order to control, 
for example, differentiation and apoptotic events which give rise to loss of neurons (e.g. 
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to enhance survival of existing neurons) as well as promote differentiation and 
repopulation by progenitor cells in the area affected. In preferred embodiments, a source 
of a DEF agent (DEF agonist) is stereotactically provided within or proximate the area 
of degeneration. 

5 In addition to degenerative-induced dementias, a pharmaceutical preparation of 

one or more of the subject DEF molecules can be applied opportunely in the treatment of 
neurodegenerative disorders which have manifestations of tremors and involuntary 
movements. Parkinson's disease, for example, primarily affects subcortical structures 
and is characterized by degeneration of the nigrostriatal pathway, raphe nuclei, locus 

10 cereleus, and the motor nucleus of vagus. Ballism is typically associated with damage to 
the subthalmic nucleus, often due to acute vascular accident. Also included are 
neurogenic and myopathic diseases which ultimately affect the somatic division of the 
peripheral nervous system and are manifest as neuromuscular disorders. Examples 
include chronic atrophies such as amyotrophic lateral sclerosis, Guillain-Barre syndrome 

15 and chronic peripheral neuropathy, as well as other diseases which can be manifest as 
progressive bulbar palsies or spinal muscular atrophies. The present method is amenable 
to the treatment of disorders of the cerebellum which result in hypotonia or ataxia, such 
as those lesions in the cerebellum which produce disorders in the limbs ipsilateral to the 
lesion. 

20 

This invention is further illustrated by the following examples which 
should not be construed as limiting. The contents of all references, patents and 
published patent applications cited throughout this application are hereby 
incorporated by reference. 

25 

EXAMPLE 1 : Purification of Bovine DEF-1 Protein 

Experimental Procedures 

SH3 binding proteins from bovine brain were purified using a src SH3 
and src SH3SH2 affinity columns. The affinity columns were constructed by 

30 cloning the avian src SH3 or src SH3SH2 domains (amino acids 88-136 and 
88-240, respectively) into the plasmid vector pGEX-2T (Pharmacia) using 
standard PCR techniques. The resulting glutathione-S-transferase src SH3 
domain fusion protein was secured to glutathione-coupled sepharose beads. Lck 
SH3, was constructed in a similar fashion using a murine c-lck gene as the initial 

35 template. The GST-DEF-1 constructs were made by cloning in the appropriate 
blunt-ended, Bgl II fragment into the Sma I site of pGEX-2T. Calf brain lysates 
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were made by homogenization in the presence of hypotonic Lysis buffer (0.25M 
Sucrose, 20 mM Tris pH 8.0,1 mM EDTA,1 mM ,8-mercaptoethanol, 2mM 
PMSF) and passed over the respective columns. Each column was washed once 
in NP40 Lysis buffer, twice in 0.5M LiCl /20 mM Tris pH 8.0, and once with 
5 PBS. Samples were eluted with 1 OmM glutathione in 120mM NaCl/1 OOmM 
Tris 8.0 and passed over an ATP-agarose column (Sigma) or eluted with SDS 
sample buffer and loaded onto a 1 0% SDS/PAGE gel. Samples passed over the 
ATP-agarose column were washed twice with PBS, eluted with SDS sample 
buffer and electrophoresed on a 5% SDS/PAGE gel The gel was electroblotted 

1 0 using PVDF membrane (Biorad) in CAPS buffer and the band corresponding to 
DEF-1 was excised. Following in situ digestion with trypsin (Fernandez et al. 
( 1 994)Analytical Biochemistry 218:11 2-7) the resulting peptide mixture was 
separated by microbore HPLC using a Zorbax C18 1.0 mm by 150 mm 
reverse-phase, column on a Hewlett-Packard 1090 HPLC/1040 diode array 

15 detector. Optimum fractions from the chromatogram were chosen based on 
differential UV absorbance at 205nm, 277nm, and 292nm, peak symmetry and 
resolution. Peaks were further screened for length and homogeneity by 
matrix-assisted laser desorption time-of-flight mass spectrometry on a Finnigan 
Lasermat 200 (Hemel, England) and selected fractions underwent automated 

20 Edman degradation on a Perkin Elmer/Applied Biosystems 494 A, 477A (Foster 
City, CA). Details of strategies for the selection of peptide fractions and their 
microsequencing have been previously described (Lane et al. (1991) Journal of 
Protein Chemistry 10:151-60). Lysates made with NP40 Lysis buffer from 
NIH-3T3 cells expressing pLNSL7 alone (vector) or HA tagged DEF-1 (DEF-1) 

25 were passed over the noted columns and washed as described above. Bound 
proteins were immunoblotted with the anti-HA antibody, 12CA5 (Babco). 
pp60 c *« ycr was detected using the monoclonal antibody "327", a gift from J. 
Brugge. 

To identify novel src SH3 binding proteins, proteins isolated from bovine 
30 brain extracts that bound to a glutathione-S-transferase src SH3 (GST-SRC SH3) 
affinity column were analyzed. Resolution of the associated proteins by 
SDS/PAGE showed several species that bound to the src SH3 but not the GST 
beads alone (Figure 1 A). This included a prominent band of approximately 100 
kD which was subsequendy identified as dynamin (Gout, I. et al. (1993) Cell 
35 75:25-36). Because dynamin also shows affinity for ATP agarose (Scaife et al. 
(1990) Journal of Cell Biology 1 1 1 :3023-33), the ability of the src SH3 
associated proteins to bind to an ATP affinity matrix was determined. This led 
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to the identification of a small number of proteins that bound to both affinity 
columns, including a protein of approximately 140 kD (DEF-1) which showed 
high abundance and good separation relative to the other proteins (Figure IB). 
Therefore, a sufficient quantity of DEF-1 was purified to enable a determination 
5 of its partial amino acid sequence. 

A large-scale preparation from bovine brain was prepared and the 
proteins that bound to both columns were separated by SDS/PAGE and blotted 
to polyvinylidene diflouride membrane resulting in approximately 20 \ig of 
purified protein. The band corresponding to DEF-1 was cut from the filter and 
1 0 sequenced. Following elution of the protein from the filter and digestion with 
endopeptidase, the peptides were separated by HPLC. Six peaks from the HPLC 
column were selected and sequenced. The partial amino acid sequence obtained 
did not correspond with any protein in the Genbank database, suggesting that 
DEF-1 was a previously unidentified src SH3 binding protein. 

15 

EXAMPLE 2 : Cloning of Bovine DEF-1 cDNA 

cDNA cloning using degenerate primers in PCR reactions was performed 
essentially as described (Lee, C.C. et al. (1990) A Guide to Methods and 
Applications (ed. M.A. Innis et al) Academic Press, pp. 46-53. Degenerate 

20 oligonucleotides were designed based on the resultant amino acid sequence of six 
tryptic peptides and used as primers in a series of nested PCR reactions using 
bovine brain mRNA as the initial template. Bovine brain RNA was reverse 
transcribed with the downstream primer "RTCRTTNGTRTCYTC" (SEQ ID 
NO: 1 3). The cDNA from this reaction was used in a PCR reaction with the 

25 same downstream primer and "CAYGTICARAAYGARGARAA" (SEQ ID NO: 
1 4) as the upstream primer. This reaction was used as a template for a 
subsequent PCR reaction using the nested upstream primer, 
"GARGARAAYTAYGCICARGT" (SEQ ID NO: 15) and the downstream 
primer. The product from this reaction was sequenced and subsequently 

30 determined to encode amino acids 92-384 of bovine DEF-1 . 

This PCR product was used to screen a bovine brain random primed 
cDNA library in the vector XZapll (Stratagene) obtained from Dr. Akio 
Yamakawa. This resulted in six unique clones, five of which contained DEF-1 
coding sequences. The sixth appears to be a related gene. A segment of one 

35 clone was used to rescreen the library which resulted in three novel DEF-1 
clones including the remainder of the coding sequence. Positives clones were 
used to isolate eight overlapping clones which resulted in approximately 5300 bp 
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of contiguous sequence. The composite sequence contained an open reading 
frame encoding a protein of 1 129 amino acids. The nucleotide and amino acid 
sequence is shown in Figure 2 (SEQ ID NO: 1) and Figure 3 (SEQ ID NO:2), 
respectively. All six peptides sequenced were found in the predicted translation 
5 product. The DEF-1 cDNA (comprised of clones S9 and R27) with the HA tag, 
"MVYPYDVPDYAG" (SEQ ID NO: 1 6), at the N-terminus was cloned into the 
expression vector," pLNSL7" and transfected into \(/2 cells to obtain infectious 
retroviral supernatants (Marth J.D. et al. ( 1 989) Journal of Immunology 
142:1430-7). 

10 

EXAMPLE 3 : Tissue Expression and Structural Features of Bovine DEF-1 

Northern blot analysis indicated that DEF-1 mRNA is expressed in 
several tissues and cell lines examined. This result suggests that expression of 
DEF is ubiquitous. Expression of DEF-1 mRNA is higher in adipose tissues 

1 5 compared to other tissues, suggesting a role for this molecule in adipogenesis. In 
addition, adipose cells obtained from obese or diabetic mouse models show 
higher levels of expression than normal mice. The pattern of expression of DEF- 
1 mRNA appeals developmentally regulated. For example, the expression of 
DEF-1 mRNA is relatively high in the developing rat brain, and decreases after 

20 birth to levels similar to the ones detected in the adult brain. . 

Genbank database searches of the cloned DEF-1 sequences for related 
protein sequences failed to identify any significant homologies. However, the 
best matches from the data base search indicated that DEF-1 shares several 
motifs with other proteins which are illustrated in Figure 3 and described below 

25 as follows. Comparison of the amino acid sequence of bovine DEF-1 protein 
revealed several motifs including four ankyrin repeats, three of which are in 
close proximity to each other (Figure 3) corresponding to amino acids 356-374, 
604-623, 640-659 and 672-692. Ankyrin is a protein that "anchors" cytoskeleton 
elements to the plasma membrane (Michaely, P. and Bennett, V. (1993) Journal 

30 of Biological Chemistry 268:22703-9). A 33 amino acid motif is repeated 24 
times within ankyrin and this region is believed to be involved in directing the 
protein to the inner face of the plasma membrane (Michaely, P. and Bennett, V. 
(1 993) Journal of Biological Chemistry 268:22703-9). This repeat has been 
found in several other proteins such as the transcription factor regulator, Ik-B 

35 (Hay, 1993). The presence of the ankyrin repeats suggests that DEF-1 may be 
targeted to the plasma membrane. DEF-1 protein also includes a C2 domain 
located approximately at amino acids 498-557. Figure 9A is an alignment of the 
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amino acid sequences of the C2 domain (amino acids 498-557) of bovine DEF-1 
(DEF zinc) with other C2 containing proteins. A comparison of these sequence 
reveals about 27.1% identity with In(l, 3,4,5) binding protein, and 28.3% identity 
with Centaurin. Figure 9B is an alignment of the amino acid sequences of the C2 
5 domain (amino acids 498-557) of bovine DEF-1 (DEF zinc) with other C2 
containing proteins that also contain a zinc finger domain (Cullen, P.J. et al 
(1995) Nature 376: 527). A comparison of these sequence reveals a 16.7, 22,2, 
13.9 and 25% identity with Synaptogemin, In(l,3,4,5) binding protein, human 
IP(1,3,4,5) and Centaurin, respectively. C2 domains are believed to be involved 

1 0 in lipid binding, primarily phosphatidy linositol binding. This finding suggests 
that DEF-1 may interact with a component of the plasma membrane, which may 
in turn regulate DEF-1 activity. 

Bovine DEF-1 also contains a pleckstrin homology (PH) domain located 
approximately at amino acids 326-419. The PH domain is a domain of about 

15 1 00 amino acids located at the carboxy-terminal of several proteins involved in 
signal transduction processes or as constituents of the cytoskeleton (Haslam et 
al. (1993) Nature 363:309-310; Mayer et al. (1993) Cell 73:629-630; Musacchio 
et al. (1993) Trends Biochem. Set 18:343-348). Bovine DEF-1 also contains one 
zinc finger domain located approximately at amino acids 457-480. Several 

20 matches found from the database search shared homology to the zinc finger 
found in ARF1 GTPase activating protein (Trainor, CD. et a/.(1990) Nature 
343:92-96). Interestingly, these proteins bind to different G proteins and are 
believed to affect their GTPase activity. Since it is possible that the G protein 
dynamin copurified with DEF-1 , this shared motif suggests that DEF-1 is also a 

25 modulator of a G protein activity. 

Additionally, DEF-1 contains an SH3 domain located at approximately 
amino acids 1073-1 123. Furthermore, bovine DEF-1 contains several proline 
rich stretches including multiple sre SH3 consensus binding sequences located at 
about amino acids 794-799, 803-809, 829-835, 895-901 and 993-999 (Rickles, 

30 RJ. et al (1995) Proceedings of the National Academy of Sciences of the United 
States of America 92:10909-13; Weng, Z. et al (1995) Molecular & Cellular 
Biology 15:5627-34; Sparks, A.B. et al. (1995) Methods in Enzymology 255:498- 
509; Alexandropoulos K. et al (1995) Proceedings of the National Academy of 
Sciences of the United States of America 92:3 1 10-4). No previously described 

35 motifs that would account for DEF-1 f s affinity for ATP agarose were apparent. 

In addition to the readily identifiable motifs described above, an unusual 
proline-rich stretch located between the SH3 domain and the predicted SH3 
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binding sites in DEF-1 was noted (amino acids 934-1001). This region can be 
subdivided into six tandem repeats centered on the consensus sequence 
"GDLPPKP". Although this motif has the PXXP motif found in SH3 binding 
proteins, it would not be predicted to form a high affinity interaction with src 
5 SH3 since it lacks a basic amino acid residue at the proper position (with the 
exception of the last repeat; Rickles, RJ. et al (1995) Proceedings of the 
National Academy of Sciences of the United States of America 92: 1 0909-1 3. 
However, the preponderance of prolines in this repeat suggests that this region 
forms a polyproline type II helix (Williamson, MP. (1994) BiochemicalJournal 

10 297:249-60). Figure 6B is a schematic of the interaction of a Src SH3 ligand 
binding site and an SH3 domain (adapted from Feng, S. et al. (1 994) Science 
266 : 1241-1247). Based on this assumption, the four C-terminal repeats form a 
trigonal prism with an acidic "edge", a basic edge, and an uncharged edge (with 
the exception noted above; Figures 7A-7B). The two longer repeats (amino acids 

1 5 934-965) have a similar pattern yet the relative charge rotates between the 
repeats. Figures 7A and 7B are schematic representations of the putative 
left-handed polyproline type II helix configuration of bovine DEF-1 proline-rich 
motifs (amino acids 934-1001). Figure 7A represents the putative structure of 
repeats 1-3 (amino acids 934-974). Figure 7B represents the putative structure of 

20 repeats 3-6 (amino acids 966-1001). 

The presence of these six proline repeats is significantly different to any 
SH3 binding sequence reported thus far. In this regard, a motif termed "WW" or 
"WWP" domain (so called because of conserved tryptophans) has been shown to 
associate with proline rich sequences. These proline rich regions tend to lack the 

25 basic amino acid near the proline helix common to SH3 binding proteins. This 
suggests that the C-terminus of DEF-1 could potentially associate with a 
WW/WWP domain containing protein. The repeated motif in DEF-1 does have 
charged amino acids albeit in the improper location for SH3 binding. 

If the repeated motif described above acts as a SH3 binding site then this 

30 is the first reported case where such a motif has been found in such a repetitive 
fashion. Consequently, this sequence may represent an unique opportunity to 
determine what amino acids are crucial for an SID interaction. The other motifs 
described above also suggest where DEF-1 is localized within a cell and how it is 
involved in signal transduction. 



35 
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EXAMPLE 4 : Identification of DEF-1 as an src SH3 binding protein 

To confirm that the DEF-1 cDNA encoded a src SH3 binding protein, the 
full length DEF-1 coding sequence fused with a hemaglutinin tag (HA) at the 
amino terminus was expressed in NIH-3T3 cells. Lysates from the subsequent 
5 drug selected, DEF-1 expressing cells were passed over a src SH3 column and 
probed with an anti-HA antibody. The protein produced by the DEF-1 cDNA 
associated with the src SID beads, which strongly suggests that it encodes the 
protein detected in Figure 1 A. 

Bovine DEF-1 co-purified with dynamin, a protein known to associate 

10 with numerous SH3 domains and ATP agarose (Gout, I. et al. (1993) Cell 75:25- 
36; Scaife, R. and Margolis, R.L. (1990) Journal of Cell Biology 1 1 1 :3023-33). 
Therefore, the interaction between DEF-1 and src SH3 may have been dependent 
upon an intermediary such as dynamin. To provide evidence that DEF-1 
associated with src SH3 directly, two GST fusion proteins spanning regions of 

15 DEF-1 that had SH3 consensus binding sequences were constructed. Lysates 
made from bovine brain or insect cells infected with baculovirus pp60 c ' src were 
passed over the respective columns and the washed beads were immunoblotted 
with an anti- ppeO 0 "^ antibody. pp60 c " 5rc isolated from either lysate 
associated efBciendy with amino acids 777-926 of DEF-1 (Fig. 6). The results 

20 in Figure 6 can be explained by a direct interaction existing between this amino 
acids 777-926 of DEF-1 and the SH3 domain in ppeO 0 "^. Even though amino 
acids 928-1 129 contains a consensus src SH3 binding site, no interaction with 
pp60 c ' src was detected. However, it is not clear if an intramolecular interaction 
with the DEF-1 SH3 domain in this construct might interfere with src SH3 

25 binding. 

EXAMPLE 5 : Binding of DEF proteins to other SH3 containing proteins 

In order to examine the possibility that one repeat of the hexa-motif 
contained in DEF is capable of binding to the SH3 domain of p85, tissue or cell 

30 lysates prepared as described in Example 1 can be passed over the 

GST-pDEFBH beads as described. The precipitate can be examined by Western 
blot using an anti-p85 antibody. If p85 does interact with this region, then the 
other five repeats may reflect the binding site for a different SH3 containing 
protein. Tissue extracts can be precipitated with the GST-pDEFBH and analyzed 

35 by AllPro stain to determine if any proteins specifically associate with this 

region. The identitity of the isolated proteins can be assessed by determining the 
electrophoretic mobility as analyzed by 2D and SDS-PAGE gels. 
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EXAMPLE 6 : Binding of DEF proteins to other SH3 containing proteins 

As described in Example 1 , DEF was purified by its ability to efficiently 
bind to a Src SH3 column. Experiments can be performed to demonstrate that 
5 pl40 binds to Src SH3 in vitro and to map the Src SH3 binding site on DEF. To 
accomplish this, full length DEF can be cloned into a bacterial expression vector 
in order to make a lacZ-pl40 fusion protein. The resultant bacterial lysate will be 
incubated with Src SH3 beads to determine if DEF can be precipitated. In the 
event that expression of DEF may be toxic to bacteria, DEF cDNA can be 
1 0 expressed in a baculo virus expression vector. 

EXAMPLE 7 : Induction of Adipogenesis bv Overexpression of Bovine 
DEF-1 in Fibroblastic Cell Lines 
To determine the phenotype associated with DEF (over)expression, the 

1 5 DEF cDNA was introduced into the fibroblastic cell line Balb/3T3. Briefly, 
Balb/c-3T3 or NIH-3T3 cells were infected with the vector alone or DEF-1 
retroviral supernatants and selected with 400jag/ml G418. Only pools of cells 
derived from more than -1000 infected cells were assayed. Upon confluence, 
the derivative NIH-3T3 cells were cultured in 1 0%FCS/DMEM and 

20 supplemented with combinations of 1 dexamethasone (Sigma), 5 insulin 
(Sigma), and 10 pioglitazone, as indicated (Tontonoz, P. et al. (1994) Cell 
79:11147-56). The medium was changed every other day. After two weeks at 
confluence, a small number of cells expressing exogenous DEF-1 formed shiny 
vacuoles. This moiphology is indicative of lipid droplets found in adipocytes, 

25 which suggests that DEF may be involved in the differentiation of fibroblasts 
into adipocytes. Cell culture conditions and differentiation assays were 
performed as described in Hu, E. et al. (1996) Science 274: 2100-2103. 

The formation of lipid droplets in the DEF-l/Balb/c-3T3 cells prompted 
the study of the role of DEF-1 in adipogenesis using NIH-3T3 cells as a model 

30 system (Cornelius, P. (1 994) Annual Review of Nutrition 14:99-129). A selected 
pool of NIH-3T3 cells infected with the DEF-1 retrovirus (DEF-1/NIH-3T3) kept 
at confluence in 10%FCS/DMEM demonstrated no visible signs of adipogenesis. 
However, parallel cultures supplemented with factors that have been previously 
shown to enhance differentiation in pre-adipocytic cell lines, particularly 

35 dexamethasone, insulin, and the thiazolidinedione, pioglitazone, demonstrated 
considerable levels of lipid accumulation as compared to the vector alone (Cao, 
Z. et al. (1991) Genes & Development Kletzien, R.F. et al. (1992) Molecular 
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Pharmacology 41 :393-8; Forman, B.M. et al. (1995) Cell 83:803-12). Lipid 
droplets turned red when stained with Oil-red-O, which is indicative of adipocyte 
differentiation. Northern blot analysis with the adipocyte specific marker aP2 
confirmed that the cultures of treated DEF-1/NIH-3T3 cells that presented lipid 
5 droplets underwent adipogenesis (Tontonoz P. et al. (1994) Genes & 
Development 8:1224-34; Spiegelman, B.M. et al. Cell 87:377-89). 

EXAMPLE 8 : Cells overexpressing Bovine DEF-1 Show Augmented 
Levels of PPARy 

10 The adipogenic activity seen in the DEF-1/NIH 3T3 ceils was dependent 

upon the presence of pioglitazone, which is a potent and specific stimulator of 
the nuclear receptor PPARy (Lehmann, J.M. et al (1995) Journal of Biological 
Chemistry 272:5367-70). NIH-3T3 cells normally demonstrate no discernible 
phenotypic changes during pioglitazone treatment presumably due to low levels 

15 of PPARy expression (Tontonoz, P. et al. (1994) Cell 79:1 147-56). However, 

ectopic expression of PPARy in NIH/3T3 cells followed by treatment with PPAR 
y activating ligands has been shown to be sufficient to promote conspicuous 
adipogenesis (Forman, B.M. (1995) Cell 83:803-12). 

While assaying for the expression of adipocytic markers in DEF-1 

20 expressing cells, elevated levels of PPARy mRNA in cells that had been treated 
with the complete differentiation cocktail were detected. Since PPARy levels 
increase during adipogenesis, this result suggests that either DEF-1 promotes 
PPARy expression or that augmented PPARy levels are the result of DEF-1 
induced fibroblastic differentiation (Tontonoz, P. et al. (1994) Genes & 

25 Development 8:1224-34). However, the culture of DEF-1/NIH-3T3 cells 
supplemented only with dexamethasone and insulin demonstrated increased 
levels of PPARy mRNA as compared to control cells. This suggests that 
heightened expression of DEF-1 synergizes with the effects of dexamethasone 
and insulin treatment to increase PPARy levels. Further supplementation of 

30 pioglitazone activates the augmented levels of PPARy resulting in the adipogenic 
phenotype. Elevated levels of PPARy mRNA expression were mirrored by 
elevated protein levels of the receptor. 

DEF-1 mRNA expression is found in adipose tissue suggesting that 
DEF-1 may have a role in adipogenesis in vivo. In fact, elevated expression of 

35 DEF-1 mRNA has been identified in obesity mouse models relative to non-obese 
mice, suggesting that DEF-1 may be an inportant regulator of adipocytic 
differentiation in normal and pathological conditions. Thus, strategies for 
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modulating DEF-1 activity may be important in treating disorders involving 
aberrant adipose cell activity such as obesity. 

The relationship between DEF-1 and PPARy expression may extend 
beyond fibroblastic differentiation since both have been detected in several 
5 different tissues (Tontonoz, P. etal. (1994) Cell 79:1 147-56. However, there are 
tissues that express DEF-1 in the absence of detectable levels of PPARy (e.g. 
brain) suggesting a target for DEF-1 other than PPARy in particular cell types. 

EXAMPLE 9 ; DEF-1 Enhances PPARy Activity in Cells Co-Expressing 
10 DEF-1 and PPARy 

To characterize the potential interaction of DEF-1 and PPARy NIH3T3 
cells transfected with PPARy alone, or co-transfected with PPARy and bovine 
DEF-1. Transfection studies were performed as described above. Results were 
characterized based on cell morphology, staining of lipid droplets with oil-red-o, 

1 5 and expression of adipocytic markers. Cells co-transfected with PPARy and 
DEF- 1 compared to cells transfected with PPARy alone showed a greater 
response to the differentiation cocktail, i.e., dexamethasone, insulin and 
pioglitazone, suggesting a synergistic differentiation effect. 

Figure 10 summarizes the quantitation of the level of adipocytic 

20 differentiation in control PPARy-expressing cells (left, solid bar) compared to 
PPARy, DEF-1 -co-expressing cells (right, speckled bar) in the presence of the 
indicated concentrations of pioglitazone. Adipocyte differentiation was detected 
by the expression of the adipocyte marker, AP2 mRNA. A potentiation of the 
pioglitazone-induced differentiation of NIH3T3 cells was observed in DEF-1- 

25 transfected cells relative to the control cells. As shown in Figure 1 0, the 

expression of DEF-1 increases the levels of AP2 mRNA roughly four fold over 
control cells at low levels of pioglitazone. The level of the AP2 mRNA was 
quantitated using a phosphorimager. Thus, if both DEF-1 and PPARy are 
overexpressed in NIH3T3 cells, a similar effect can be seen if the cells are 

30 supplemented with lower levels of pioglitazone than cells expressing PPARy 
only. This results suggests that therapeutic strategies targeting PPARy- 
dependent pathways can be expanded to include modulators of DEF-1 activity or 
expression. 



35 



EXAMPLE 10 : Deletion Analysis of bovine DEF-1 

To localize the domains of bovine DEF-1 necessary for biological 
activity, deletion analysis of the DEF-1 construct was performed. To generate 
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these mutants, full length bovine DEF-1 cDNA was digested with the appropriate 
restriction enzyme (either Apa or Bgl enzymes) to generate two sets of mutants: 
DEF-1 /Apa mutants which encode amino acids 1-800 and DEF-1 /Bgl which 
encode the last 200 amino acids of bovine DEF-1 . Digested fragments were 
5 subcloned into cloned into the expression vector," pLNSL7" and transfected into 
\}/2 cells to obtain infectious retroviral supernatants (Marth, J.D. et al. (1989) 
Journal of Immunology 142:2430-7). Figure 1 1 is a schematic representation of 
deletion mutants of bovine DEF-1. DEF-l/Apa mutants (amino acids 1-800) and 
DEF-l/Bgl mutants (last 200 amino acids of bovine DEF-1 containing the 

1 0 proline-rich repeat and the SH3 domain). 

To assay for the ability of these mutants to induce adipogenesis, 
Balb/c-3T3 or NIH-3T3 cells were transfected as described in Example 7. 
Transfected and control cells were cultured and assayed for adipogenic activity 
as described above. Induction of adipogenesis was observed with the two 

15 constructs tested. However, DEF-l/Bgl mutants showed even higher activity 
than the full length clone, which indicates that the last 200 amino acids of DEF-1 
are sufficient to induce adipogenesis. 

EXAMPLE 11 : Signal Transduction Mechanism of DEF proteins 
20 Preliminary studies indicate that PPARy is a substrate for MAP Kinase 

(MAPK) p42/44 MAPK . When MAPK is active (as it is growing cells), PPARy is 
phosphorylated and its activity is down-regulated. A cbnstitutively active form 
of PPARy can be made by mutating the MAPK phosphorylation site. Therefore, 
DEF may be able to enhance adipogenesis by inhibiting MAPK and indirectly 
25 activating PPARy. 

Preliminary experiments indicate that expression of DEF increases the 
levels of active pSS 1 ^^ in cells as detected by Western blots in NIH3T3 cells 
transfected with DEF relative to the untransfected controls. This result suggests 
that DEF is an upstream effector of p38MAPK and activates a pathway distinct 
30 from PPARy. Therefore, these two pathways may be able to complement each 
other in enhancing the differentiation of fibroblasts. 

EXAMPLE 12; Mechanism of Action of DEF proteins 

Described above is a novel signal transduction molecule, DEF-1 , whose 
35 overexpression in fibroblasts participates in augmentation of PPARy levels and 
induction of cellular differentiation in fibroblasts. The increase in PPARy in 
DEF-1 expressing cells may be a consequence of DEF-1 induced fibroblastic 
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differentiation or may result from DEF-1 signal transduction targeting PPARy 
expression. The latter hypothesis appears more likely since PPARy expression 
was noted in DEF-1/NIH-3T3 cells treated with dexamethasone and insulin in 
the absence of discernible differentiation. 

The mechanism by which dexamethasone and insulin treatment 
synergizes with ectopic expression of DEF-1 in NIH-3T3 cells to augment PPAR 
y levels is unclear at the present. Dexamethasone and insulin have been shown 
to induce or maintain the expression of particular members of the PPAR and 
C/EBP families of transcription factors (Spiegelman, B.M. and Flier, J.S. (1996) 
Cell 87:377-89; Brun, R.P. (1996) Genes & Development 10:974-84; Mandrup, 
S. and Lane, M.D. (1997) Journal of Biological Chemistry 272:5367-70). For 
example, dexamethasone has been shown to induce the expression of C/EBPP 
which cooperates with C/EBPp to promote the synthesis of PPARy in pre- 
adipocytes (Yeh, W.C. et al. (1995) Genes and Development 9:168-81; Wu, Z. et 
aL (1996) Molecular & Cellular Biology 16:4128-36). Elevation of PPARy 
levels in DEF-1 cells may result from the expression of a C/EBP family member 
(such as C/EBPP) or an unknown factor that regulates the amount of PPARy. 
These uncharacterized components may also be affected by dexamethasone since 
constitutive C/EBPp expression does not appear to compensate entirely for 
dexamethasone treatment in the induction of adipogenesis (Wu, Z.N. (1996) 
Molecular & Cellular Biology 16:4128-36). 

DEF-1 has several motifs which suggests that it interacts with other 
presently unidentified proteins to achieve its biological effects and, therefore, 
may act as a "scaffolding" protein (Figures 3, 7 and 8). Potential DEF-1 
associating proteins are likely localized to the cytoplasm since we have several 
lines of evidence (including the purification of DEF-1 using a hypotonic lysis 
buffer) suggesting that DEF-1 has a cytosolic subcellular localization (Figure 
1 A). However, the presence of ankyrin repeats implies that DEF-1 may have at 
least a transient association with the plasma membrane (Michaely, P. and 
Bennet, V. (1993) Journal of Biological Chemistry 268:22703-9). The zinc 
finger or DEF-1 is closely related to several proteins in the database including a 
GTPase activating protein (Trainor, CD. et al (1990) Nature 343:92-6). 
Interestingly, DEF-1 co-purified with the GTPase, dynamin (Figure 1 A). 

The purification of DEF-1 involved a src SH3 affinity column which 
implies pp60 c -* rc is potentially involved in the DEF-1 induced phenotypes 
observed. Although a pp60 c ' 5rc binding site has been mapped to a region of 
DEF-1 containing src SH3 consensus binding sequences (Figure 5), a 
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reproducible interaction between the two full length proteins has not been 
demonstrated. However, this potential interaction may be regulated. The 
presence of both an SH3 domain and SH3 binding sites in DEF-1 suggests that 
these regions are involved in an intramolecular interaction or dimerization 

5 between two DEF-1 molecules. However, the proline rich repeats between these 
two regions (amino acids 934-1001) could act as a rigid "spacer" which likely 
would discourage intramolecular folding. Furthermore, this repetitive motif may 
play a role in DEF-1 homodimerization: this region of DEF-1 can be aligned 
with the identical sequence written in the opposite orientation resulting in almost 

10 every charged amino acid residue being paired with a residue of opposite charge 
(Figure 8). The significance of this charge distribution becomes more evident if 
this region forms a polyproline type II helix and takes on the conformation 
modeled in Figures 7A and 7B. This would enable the polyproline type II 
helices from two DEF-1 molecules to array in a manner where "edges" of 

15 opposite charges align (Figure 8). Altogether, this model of DEF-1 dimerization 
suggests a mechanism whereby the accessibility to the SH3 domain and possibly 
SH3 binding sites within DEF-1 is regulated. 

The proline-rich repeat may also function as a long, rigid structure that 
keeps the two parts of the DEF-1 protein separated. For example, this repeat 

20 prevents the SIB domain of a DEF-1 monomer from interacting with the SH3 
binding sites. This is supported by the feet that the first lysine in the last proline- 
rich repeat is rare for this location, where aliphatic amino acids are typically 
seen. A lysine residue at this location is evolutionary conserved among different 
species such as human and zebrafish, suggesting an inportant function. The 

25 lysine at this position makes the last proline repeat an SID binding consensus 
sequence, therefore, a protein that has an SIC domain might bind at this location. 
In addition, there are signal transduction proteins that have two SH3 domains 
(such as GRB-2). Thus, a protein having two SH3 domains may bind to DEF-1 
using this last repeat. Then, the rest of the proline-rich repeats would provide a 

30 spacer to keep the two SH3 binding sequences at the proper spacing for the target 
protein to bind. 

The ubiquitous expression of DEF-1 implies that DEF-1 signal 
transduction is not restricted to adipogenesis. Moreover, amino acid sequence of 
partial cDNAs corresponding to DEF-1 homologues reveal that DEF-1 has been 
35 extremely well conserved between zebrafish, mice, rats, cows, and humans 

which argues that DEF-1 is a signal transduction component within a variety of 
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species (Yamabhai, M. and Kay, B.K. (1997) Analytical Biochemistry 247:143- 
51). 

EXAMPLE 13; Cloning of Zebrafish DEF Family Members 

Experimental Procedures 

Bovine DEF-1 cDNA Xbal-EcoRI fragment (~4kb) was used as probe to 
screen zebrafish 1 8-hour and 24-hour embryo cDNA libraries in the vector 
ZAPExpress (Stratagene). In this library screen, ~1 x 10 6 plaques were plated, 
transferred to nylon membranes (Genescreen plus, NEN Life Science Products) 
and hybridized at low stringency in 30% formamide at 42°C (Chan and Watt 
(1991) Oncogene 6:1057-1061). The DNA probe was labeled with [a 32 P]- 
dCTP using a random primed labeling kit (Boehringer Mannheim) and washed in 
15 mM sodium chloride, 1.5 mM sodium citrate and 0.1% sodium dodecyl 
sulphate at 42°C. Plaque-purified ZAPEX press phages were automatically 
excised using the helper phage Exassist into the plasmid pBK-CMV 
(Stratagene). Plasmid DNAs were sequenced using the dideoxy method 
following standard protocols. Zebrafish cDNAs encoding full-length DEF 
related proteins, ZDEF-1, ZDEF-2, and ZDEF-3, were analyzed using the DNA 
Star Sequence Analysis Programs. Full-length nucleotide sequences of the 
zebrafish genes are provided herein as follows: DEF-1 gene (Figure 13; SEQ ID 
NO: 3 (coding and untranslated regions); SEQ ID NO: 5 coding sequence only); 
DEF-2 gene (Figure 14; SEQ ID NO: 6 (coding and untranslated regions); SEQ 
ID NO: 8 coding sequence only); and DEF-3 gene (Figure 15; SEQ ID NO: 10 
(coding and untranslated regions); SEQ ID NO: 1 1 coding sequence only). 

An alignment of the amino acid sequences of DEF family members is 
shown in Figure 12. Amino acid sequences corresponding to bovine DEF-1 
(SEQ ID NO: 2); zebrafish DEF-1 (SEQ ID NO: 4); zebrafish DEF-2 (SEQ ID 
NO: 7); zebrafish DEF-3 (SEQ ID NO: 10); and human DEF-2 (SEQ ID NO: 12) 
are indicated. A schematic representation of zebrafish DEF family structure is 
depicted in Figure 16. 

A comparison of the amino acid sequences of the zebrafish family 
members indicated a highly conserved N-terrninal domain of about 750 amino 
acids with higher variation at the C-termini. A comparison of the full length 
sequences between zebrafish DEF-1 and DEF-2 revealed about 55.7% amino 
acid identity, whereas the amino acid sequence identity of the N-terminal 
domains was about 52.2%. A similar comparison between the zebrafish DEF-1 
and DEF-3 sequences revealed about 51% identity of the full length protein, 
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compared to 52.7% identity of the N-terminal domain. Similarly, a 62.3% full 
length identity was found between zebrafish DEF-2 and DEF. 3, compared to 
66% identity between the N-terminal domains. As represented in schematic 
form in Figure 16 and detailed below as Table 1, zebrafish DEF-1 contains the 
same domains as bovine DEF-1 showing: four ankyrin related motifs, one zinc 
finger, SH3 binding sites, a proline-rich repeated motif and an SH3 domain. 
Zebrafish DEF-2 differs from DEF-1 sequence by lacking the proline-rich 
repeated motiff as depicted in Figure 16. Zebrafish DEF-3 which is the shorter 
version of the three DEF proteins lacks the proline-rich repeated motiff and the 
SH3 domain. The approximate amino acid location of these domains is indicated 
in Table 1 below. 



